WORLD INTELXECrUAL PROPERTY ORGANIZATION 
Inteniational Bureau 




INTERNATIONAL APPUCATION PUBUSHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) Intematioiial PaUnt Clas^cation ^ : 

C07K l-V/OS, 14/71, G06F 17/50, 19/00 
// 159:00 



Al 



(11) International PubUcation Number: WO 99/62955 

(43) International Publication Date: 9 December 1999 (09.12.99) 



(21) International AppUcation Number: PCT/AU99/00420 

(22) International lifting Date: 31 May 1999 (31.05.99) 



(30) Priority Data: 
PP3804 



29 May 1998 (29.05.98) 



AU 



(71) Applicants (for all designated States exce pt US) : BIOMOLEC- 

ULAR RESEARCH INSTITUTE UMTIED [AU/AU]; 343 
Royal Parade. PaAviUe, VIC 3052 (AU). LUDWIG INSTI- 
TUTC FOR CANCER RESEARCH [AU/AUJ; Entrance 5, 
Level 6, Comer Grattan Street & Royal Parade, Paikville, 
VIC 3052 (AU). COMMONWEALTH SCIENTIFIC AND 
INDUSTRIAL RESEARCH ORGANISATION [AU/AU]; 
Limestone Avenue, Campbell, ACT 2601 (AU). 

(72) Inventors; and 

(75) InventorsAAppHcants (for US only): ELLEMAN, Thomas, 
Charles [AU/AU]; 12 Swan Avenue, Westmeadows, VIC 
3049 (AU). EPA. Vidanagamage. Chandana [LK/AU]; 
5/212 Tbe Avenue, Paikville, VIC 3052 (AU). GARRFTT, 
Ibomas, Peter, JiAm [AU/AU]; 2 Gray Street, Brunswick, 
VIC 3055 (AU). JORISSEN, Robert, Nicholas [AU/AU]; 
138 Bloomfield Road, KeysbMOUgh, VIC 3173 (AU). LOU, 
MeizhMi [AU/AU]; 10 Roma Street, Scoresby, VIC 3179 
(AU). BURGESS, Antony, Wilks [AU/AU]; 32 Range 



Street, Cambenvell, VIC 3124 (AU). McKERN. Neil, 
Moreton [AU/AU]; Como Road, LUydale, VIC 3170 (AU). 
TREUTLEIN, Herbert, Rudolf [DE/AU]; 1 1 George Street, 
Motmee Ponds, VIC 3039 (AU). WARD, Colin, Lesley 
[AU/AU]; 903 Rathdowne Street, Carilon, VIC 3054 (AU). 

(74) Agent: FJ8. RICE & CO.; 605 Darting Street. Balmain, NSW 
2041 (AU). 



(81) Deagnated Stales: AE. AL, AM, AT, AU, /VZ, BA, BE, BG, 
BR, BY, CA, CH, CN, CU, CZ, DE, DK, EE, ES, FI, GB, 
GD, GE, GH, GM, HR, HU, ID, IL, IN, IS, JP, KE, KG, 
KP, KR, KZ, LC, LK, LR, LS, LT, LU, LV, MD, MG, MK. 
MN, MW, MX, NO, NZ, PL, PT, RO. RU, SD, SE, SG. SI, 
SK, SL, TJ, TM, TR, TT, UA. UG. US. UZ. VN, YU. ZA, 
ZW. ARIPO patent (GH, GM, KE, LS, MW. SD, SL, SZ 
UG. ZW). Eurasian patent (AM, AZ, BY, KG, KZ, MD, 
RU, TJ, TM). European patent (AT. BE, CH, CY, DE. DK. 
ES, FI. PR. GB, GR, IE. FT, LU. MC, NL. PT, SE). OAPI 
patent (BP. BJ. CF. CG, CI. CM, GA, GN, GW. ML, MR. 
NE, SN. TO. TG). 



Published 

Wi/A international search report. 



(54) TlUe: METHOD OF DESIGNING AGONISTS AND ANTAGONISTS TO EGF RECEPTOR FAMILY 
(57) Abstract 

Hie |»esent invention relates to a method of designing compounds able to bind to a molecule of the EGF receptor family and to 
modulate the activity mediated by the receptor molecule based on the 3-D structure coordinates of the EGF receptOT crystal of Figure 6. 



FOR THE PURPOSES OP INFORMATION ONLY 



Codes used to identify States party to die PCT on flie front pages of pam|Mets publishing inteniaticxtal ai^lications tinder the PCX. 



AL 


Allnnia 


MS 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Aimcnia 


FI 


Finland 


LT 


Lithuania 


SK 


Skw^ 


AT 


Amtria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


MJ 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaaland 


AZ 


Azeibaijan 


GB 


United Kingdom 


MC 


Monaco 


TO 


Chad 


BA 


Bosnia and Hetzcgovina 
Barbados 


GE 


Georgia 


MD 


Republic <tf MoMova 


TG 


Togo 


BB 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgiiiin 


GN 


Guinea 


MK 


The fbntter Yugoslav 


TM 


l^irkmenistan 


BF 


Bmfcma Faso 


GR 


Oreece 




Republic of Macedonia 


TR 


Turkey 


EG 


Bulgaria 


HU 


Hongary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


VG 


Uganda 


BY 


Belanis 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


UZ 


Uzbekistan 


Ct 


Cenba) African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YIJ 


Yugoslavia 


CII 


Swioertand 


KG 


Kyigyzstan 


NO 


Norway 


ZW 


Zimbabwe 


CI 


Cated'Ivaiie 


KP 


Democratic FlB0[dc's 


NZ 


New Zealand 






CM 


Cameroon 




RqnMic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






CU 


Cuba 


KZ 


Kazakstan 


RO 


Ronumia 






CZ 


Czedi Ketaublic 


LC 


Saint Lucia 


RU 


Russian Fcdenuion 






DB 


Germany 


U 


Liechtenstein 


SD 


Sudan 






DK 


Dcnsnaric 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


Ul 


Liberia 


SG 


StngsQxne 







wo 99/62955 



PCT/AU99/00420 



METHOD OF DESIGNING AGONISTS AND ANTAGONISTS TO 
EGF RECEPTOR FAMILY 

Field of the Invention 

This invention relates to the field of epidermal growth factor (EGF) 
receptor structure and EGF receptor/ligand interactions. In particular, it 
5 relates to the field of using the EGF receptor structure to select and screen for 
ligands of the EGF receptor. 

Background of the Invention 

Epidermal growth factor is a small polypeptide cytokine that 

10 stimulates marked proliferation of epithelied tissues and is a member of a 
larger family of structurally related cytokines such as transforming growth 
factor a (TGFa), amphiregulin, betacellulin, hepeuin-binding EGF and some 
viral gene products. Abnormal EGF family signalling is a characteristic of 
certain cancers (Soler, C. & Carpenter, G., 1994 In Nicola, N. (ed) "Guidebook 

15 to Cjrtokines and their Receptors", Oxford Univ. Press, Oxford, ppl94-197; 
Walker, F. & Burgess, A. W., 1994, In Nicola, N. (ed) "Guidebook to Cytokines 
and their Receptors", Oxford Univ. Press, Oxford, ppl98-201). 

The epidermal growth factor receptor (EGFR) is the cell membrane 
receptor for EGF (Ullrich, A., and Schlessinger, J. (1990) Cell 61, 203-212). 

20 The EGFR also binds other ligands that contain amino acid sequences 

classified as the EGF-like motif. Among these ligands, the three-dimensional 
structures of EGF and TGFa have been determined by NMR (Montelione, 
G.T.; Wuthrich, K.; Nice, E.C., Burgess, A.W. and Scheraga, H.A. (1986) 
PNAS 83(22): 8594-8; Campbell, LD., Cooke, R.M., Baron, M., Harvey, T.S., 

25 and Tappin. M.J. (1989) Prog. Growth Factor Res. 1, 13-22). Upon binding of 
the ligand to the extracellular domain, the EGFR undergoes dimerization, 
which eventually leads to the activation of its cytoplasmic protein tyrosine 
kinase (Ullrich, A., and Schlessinger, J. (1990) Cell 61, 203-212). The EGFR is 
also known as the ErbB-1 receptor and belongs to the t3rpe I family of receptor 

30 tyrosine kinases (Ullrich. A., and Schlessinger, J. (1990) Cell 61, 203-212). 

This group also includes the ErbB-2, ErbB-3 and ErbB-4 receptors. The ligand 
of ErbB-2 is still unknown but it is clear that heregulin binds to EibB-3 and 
ErbB-4 (Flovwnan, G.D., Green, J.M., Calouscou, J.M., Carlton, G.W., 
Rothwell, V.M.. and Buckley, S. (1993) Nature 366, 473-475). One of the 

35 heregulins is known as neuregulin or NDF and contains an EGF-like 

sequence that was fovmd to fold into an EGF-like fold by NMR (Nagata, K., 
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Kohda, D., Hatanska, H., Ichikawa, S., Matsuda, S., Yamamoto, T., Suzxiki, 
A., and Inagaki, F. (1994) EMBO f. 13, 3517-3523 and Jacobson, N.E., Abadl, 
N., Sliwkowski, M.X., Reilly, D., Skelton, N.J., and Fairbrother, W.J. (1996) 
Biochemistry 36, 3402-3417). 
5 The type II family of receptor tyrosine kinases consists of the insuUn 

receptor (INSR), the insulin-like growth factor I receptor [IGF-1), and the 
insvdin receptor-related receptor (Ullrich, A., and Schlessinger, J. (1990) Cell 
61, 203-212). Although the type n receptors consist of four chains (agPz). 
both the extracellular portions of the receptors from the two families, as well 

10 as the tjo-osine kinase portions, share significant sequence homology, 

suggesting a conmion evolutionary origin (Ullrich, A., and Schlessinger, J. 
(1990) Cell 61, 203-212, and Bajaj, M., Waterfield. M.D.. Schlessinger, J., 
Taylor, W.R., and Blundell, T. (1987) Biochim. Biophys. Acta 916. 220-226). 
The 621 amino acid residues of the extracelluJar domain of the human 

15 EGFR (sEGFR) can be subdivided into four domains as follows: Ll, Si, L2 
and S2, where L and S stand for "large" and "small" domains, respectively 
(Bajaj, M., Waterfield, M.D., Schlessinger, J., Taylor, W.R., and Blundell, T. 
(1987) Biochim. Biophys. Acta 916, 220-226, see Fig. 2). The Ll and L2 
domains are homologous, as are the Si and S2 domains. 

20 Ligand-induced dimerization was first reported for the EOF receptor 

(Schlessinger, J. (1980) Trends Biochem Sci 13, 443-447) and now is widely 
accepted as a general mechanism for the transmission of growth stimulatory 
signals across the cell membrane. Although many biochemical experiments 
have been performed to reveal the molecular mechanism of receptor 

25 dimerization (Lemmon, M.A., Bu, Z., Ladbuiy, J.E., Zhou, M., Pinchasi, D., 
Lax, L., Engelman, D.M., and Schlessinger, J. (1997) EMBO J. 16. 281-294 and 
Tzabar, E., Pinkas-Kreunarski. R., Moyer, J.D., Klapper, D.N., Alroy, L., 
Levkowitz, G., Shelly, M., Henis, S.. Eisenstein, M., Ratzkin, B.J.. Sela, M., 
Andrews, G.C., and Yarden, Y. (1997) EMBO J. 16, 4938-4950 and Lax, L.. 

30 Mitra, A.K., Ravern, C., Hurwitz, D.R., Rubinstein, M., Ullrich, A., Stroud, 
R.M., and Schlessinger, J. (1991),/. Biol. Chem. 266, 13828-13833), the 
molecular mechanism by which monomeric ligands induce dimerization is 
still imknown for members of the EGFR family. Single particle averaging of 
electron microscopic images suggests that the overall shape of the sEGFR is 

35 four-lobed and doughnut-like (Lax, L., Mitra, A.K., Ravem, C., Hurwitz, D.R., 
Rubinstein, M., Ullrich, A., Stroud, R.M., and Schlessinger, J. (1991),/. Biol. 
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Chem. 266, 13828-13833). Small angle x-ray scattering also indicates that the 
sEGFR is a flattened sphere with long diameters of 110 A and a short 
diameter of 20 A (Lemmon, M.A., Bu, Z., Ladbury, J.E., Zhou, M., Pincheisi, 
D., Lax, L., Engelman, D.M., and Schlessinger, J. (1997) EMBO J. 16, 281-294). 
5 The crystallization of sEGFR in complex with EGF has been published 
(Giinther, N., Betzel, C., .and Weber, W. (1990) /. Biol. Chem. 265, 22082- 
22085; Degenhardt M., Weber W., Eschenburg S., Dierks K., Funari SS., Rapp 
G. and Betzel C. (1998) Acta Crystallogr. D Biol. Crystallogr. 54:999-1001), but 
the structure has not yet been reported, despite a decade of effort by many 
10 groups. 

One EGF receptor ligand, TGF-a has been observed to be overproduced 
in keratinocyte cells which are subject to psoriasis (Turbitt, M.L. et al., 1990, 
/. Invest. Dermatol. 95(2), 229-232; Higashimyama, M. et al., 1991, /. 
Dermatol., 18(2). 117-119; Elder, J.T. etal. 1990, 94(1), 19-25). The 

15 overproduction of at least one other EGF receptor ligand, amphlregulin, has 
also been implicated in psoriasis. (Piepkom, M. 1996, Am. /. Dermatopath., 
18(2), 165-171). Molecules that inhibit the EGF receptor have been shown to 
inhibit the proliferation of both normal keratinoc5rtes (Dvir, A. et al, 1991, /. 
Cell Biol., 113(4), 857-865) and psoriatic keratinocytes. (Ben-Bassat, H. etal., 

20 1995, Exp. Dermatol., 4(2), 82-88). These findings indicate that EGF receptor 
antagonists may be useful in the treatment of psoriasis. 

Many cancer cells express constitutively active EGFR (Sandgreen, E. 
P., et al., 1990, Cell, 61:1121-135; Karnes, W. E. J., et al., 1992, 
Gastroenterology, 102:474-485) or other EGFR family members (Hynes, N. 

25 E.,1993, Semin. Cancer Biol. 4:19-26). Elevated levels of activated EGFR 

occur in bladder, breast lung and brain tumours (Harris, A. L., et al., 1989, In 
Furth & Greaves (eds) The Molecular Diagnostics of human CEuicer. Cold 
Spring Harbor Lab. Press, CSH, NY, pp353-357). Antibodies to EGFR can 
inhibit ligand activation of EGFR (Sato, J. D., et al., 1983 Mol. Biol. Med. 

30 1:511-529) and the growth of many epithelial cell lines (Aboud-Pirak E., et 
al., 1988, J. Natl Cancer Inst. 85:1327-1331). Patients receiving repeated doses 
of a humEinised chimeric anti-EGFR monoclonal antibody (Mab) showed 
signs of disease stabilization. The large doses required and the cost of 
production of humanised Mab is likely to limit the application of this type of 

35 therapy. These findings indicate that the development of EGF receptor 
emtagonists will be attractive cuiticancer agents. 
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Summary of the Inventioii 

The present inventors have now obtained three-dimensional structural 
information concerning the epidermal growth factor receptor (EGFR). This 
structural information was obtained by comparative modelling based on the 
5 three-dimensional structure of the IGF-1 receptor as described in 

PCT/AU98/00998. The information presented in the present appUcation can 
be used to predict the structure of related members of the EGF receptor 
family, and to develop specific ligands of members of the EGF receptor 
family for therapeutic applications. 

10 Accordingly, in a first aspect the present invention provides a method 

of designing a compound which binds to a molecule of the EGF receptor 
fEunily and modulates an activity mediated by the molecule, which method 
comprises the step of assessing the stereochemical complementarity between 
the compound and a topographic region of the molecule, wherein the 

15 molecule is characterised by 

(i) amino acids 1-621 of the EGF receptor positioned at atomic 
coordinates substantially as shown in Figxire 6; 

(ii) one or more subsets of said amino acids related to the coordinates 
shown in Figure 6 by whole body translations and/or rotations; or 

20 (iii) amino acids present in the emaino acid sequence of a member of 

the EGF receptor family, which form an equivalent three-dimensional 
structure to that of the receptor site defined by amino acids 1-621 of the EGF 
receptor positioned at atomic coordinates substantially as shown in Figure 6. 
In a preferred embodiment of the first aspect, the topographic region of 

25 the molecule is defined by amino acids 1-475 of the EGF receptor, or an 

amino acid sequence which forms an equivalent three-dimensional structure 
to that of the region defined by amino acids 1-475 of the EGF receptor 
positioned at atomic coordinates substantially as shown in Figure 6. 

In a further preferred embodiment of the first aspect, the topographic 

30 region of the molecule is defined by amino acids 313-621 of the EGF receptor, 
or an amino acid sequence which forms an equivalent three-dimensional 
structure to that of the region defined by amino acids 313-621 of the EGF 
receptor positioned at atomic coordinates substantially as shown in Figure 6. 
The phrase "EGF receptor family" includes, but is not limited to, the 

35 EGF receptor, ErbB2. ErbB3 and ErbB4. In general, EGF receptor family 
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molecules show similar domain arrangements and share significant sequence 
identify, preferably at least 40% identify. 

The EGF receptor molecule defined in the first aspect of the present 
invention is depicted in Figure 5. The fragment comprising residues 1-475 of 
5 the receptor comprises the Ll, Si and L2 domains of the ectodomain of the 
EGF receptor. At the centre of this structure is a cavify, boxmded by all three 
domains, of sufficient size to accommodate a ligand molecule. 

The fragment comprising residues 313-621 comprises the L2 and S2 
domains, which are positioned such that they form a "corner" structme. It is 
10 envisaged that this comer structure provides a further binding site for ligands 
of EGF receptor family members. 

By "stereochemiceil complementarity" we mean that the substance or a 
portion thereof correlates, in the manner of the classic "lock-and-key 
visualisation of ligand-receptor interaction, with the cavify in the receptor 
15 site. 

In a preferred embodiment of the first aspect of the present invention, 
the method further involves selecting or designing a compound which has 
portions that match residues positioned on the surface of the receptor site as 
depicted in Figures 7, 8 and 9. By "match" we mean that the identified 
20 portions interact with the surface residues, for exEunple, via h3rdrogen 

bonding or by enthalpy-reducing Van der Waeds interactions which promote 
desolvation of the biologically active compound within the site, in such a 
way that retention of the compound within the cavify is favoured 
energetically. 

25 In a further preferred embodiment of the first aspect of the present 

invention, the method includes screening for, or designing, a compound 
which possesses a stereochemistry and/or geometry which edlows it to 
interact with both the Ll and L2 domains of the receptor site. It is believed 
that EGFR monomers may dimerise in nature in such a manner that the 

30 cavities of each monomer may face each other. Accordingly, the method of 
the first aspect of the present invention may involve screening for, or 
designing, a biologically active compound which interacts with the Ll 
domain of one monomer emd the L2 domain of the other monomer. 

In a further preferred embodiment of the first aspect of the present 

35 invention the compoimd interacts with a fragment in the region of the Ll 
domain-Sl domain interface, causing an alteration in the positions of the 
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domains relative to each other. Preferably, the interaction of the compoimd 
causes the Ll and Si domains to move away from each other. In a further 
preferred embodiment the compoimd interacts with the hinge region between 
the Si domain and the L2 domain causing an alteration in the positions of 
5 these domains relative to each other. In a further preferred embodiment the 
compound interacts with the p sheet of the Ll domain causing an alteration 
in the position of the Ll domain relative to the position of the Si domain or 
L2 domain. 

In a ftuther preferred embodiment, the compound binds to a lower face 
10 (according to orientations shown in Figures 3 and 4) containing the second P- 
sheet of the Ll and/or L2 domains, wherein the structure of the face is 
characterised by a plurality of solvent-exposed hydrophobic residues. 
Examples of these hydrophobic residues include TyiGA, Leu66, Tyr89. Tyr93 
(see Figure 7), Leu348, Phe380 and Phe412 (see Figure 10). 
15 In a further preferred embodiment the compoimd interacts with the 

hinge region between the L2 domain and S2 domains, causing an alteration 
in the positions of the Ll and L2 domains relative to each other. Preferably, 
the interaction of the compoimd causes the Ll and L2 domains to move away 
firom each other. 

20 In a ftuther preferred embodiment the compoimd interacts with the p 

sheet of the L2 domain causing an alteration in the position of the L2 domain 

relative to the position of the Ll domain. 

In a further preferred embodiment of the present invention, the 

stereochemical complementarity is such that the compoimd has a Kj for the 
25 receptor site of less than 10'*M. More preferably, the value is less than 

lO'^M and more preferably less than IC^M. 

In preferred embodiments of the first aspect of the present invention, 

the compound is selected or modified from a known compound identified 

from a data base. 

3D In one embodiment of the first aspect, the compound has the ability to 

increase an activity mediated by the molecule of the EGF receptor family. 

In another embodiment, the compound has the abihty to decrease an 
activity mediated by the molecule of the EGF receptor family. Preferably, the 
stereochemical interaction between the compound and the receptor site is 

35 adapted to prevent the binding of a natural hgand of the molecule of the EGF 
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receptor family to the receptor site. Preferably, the compound has a K, of less 
than lO'^M, more preferably less than 10"*M and more preferably less than lO'^M. 

In a second aspect the present invention provides computer-assisted 
method for identifying potential compounds able to bind to a molecule of the 
5 EGF receptor family and to modulate an activity mediated by the molecule, 
using a progranuned computer comprising a processor, an input device, and an 
output device, comprising the steps of: 

(a) inputting into the programmed computer, through the input device, 
data comprising the atomic coordinates of the EGF receptor molecule as shown 

10 in Figure 6, or a subset thereof; 

(b) generating, using computer methods, a set of atomic coordinates of a 
structure that possesses stereochemical complementarity to the atomic 
coordinates of the EGF receptor site as shovra in Figure 6, or a subset thereof, 
thereby generating a criteria data set; 

15 (c) comparing, using the processor, the criteria data set to a computer 

database of chemical structures; 

(d) selecting from the database, using computer methods, chemical 
structures which are similar to a portion of said criteria data set; and 

(e) outputting, to the output device, the selected chemical structures 
20 which are similar to a portion of the criteria data set. 

In a preferred embodiment of the second aspect, the method is used to 
identify potential compounds which have the ability to decrease an activity 
mediated by the receptor. 

In a farther preferred embodiment of the second aspect, the method 
25 further comprises the step of selecting one or more chemical structures from 

step (e) which interact with the receptor site of the molecule in a manner which 
prevents the binding of natural ligands to the receptor site. 

In a further preferred embodiment of the second aspect, the method 
further comprises the step of obtaining a compound with a chemical structure 
30 selected in steps (d) and (e), and testing the compound for the ability to decrease 
an activity mediated by the receptor. 

In a further preferred embodiment of the second aspect, the method is 
used to identify potential compounds which have the ability to increase an 
activity mediated by the receptor molecule. 
35 In a further preferred embodiment of the second aspect, the method 

further comprises the step of obtaining a molecule with a chemical structure 
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selected in steps (d) and (e), and testing the compound for the ability to increase 
an activity mediated by the receptor molecule. 

The present invention also provides a method of screening of a putative 
compound having the abiUty to modulate the activity of a molecule of the EGF 
5 receptor family, comprising the steps of identifying a putative compound by a 
method according to the first or second aspects, and testing the compound for 
the ability to increase or decrease an activity mediated by the molecule. In one 
embodiment, the test is carried out in vitro. Preferably, the in vitro test is a high 
throughput assay. In another embodiment, the test is carried out in vivo. 

10 In a third aspect the present invention provides a compound able to bind 

to a molecvde of the EGF receptor family and to modulate an activity mediated 
by the molecule, the compound being obtained by a method according to the 
present invention. 

In a preferred embodiment of the third aspect, the compoxmd is a 

15 mutant ligand of a molecule of the EGF receptor family, where at least one 
mutation occurs in the region of the ligand which interacts with residues on 
the surface of the receptor site facing toward the cavity. For example, the 
residues Arg 41 and Tyr 13 in EGF are conserved in other members of the 
EGF receptor family of ligands (a Phe residue may be substituted for Tyr 13). 

20 Structures of several EGF family members show the two residues to be in 
close proximity (Groenen, L.C., Nice, E.G., Burgess, A.W., 1994, Growth 
Factors 11:235-257). This portion of EGF may interact with a hydrophobic 
portion of the EGF receptor which contains one or more negatively charged 
residues such as the lower 3 sheet of the Ll domain. Mutants of EGF which 

25 show altered activity may be generated by introducing modifications to Arg 
41 or Tyr 13 or other nearby residues. Alternatively, mutants of EGF maybe 
generated by introducing modifications to residues on the opposite side of 
the ligand which may interact with a second receptor molecule in the 
unmoditied ligand. 

30 In a fourth aspect the present invention provides a compound which 

possesses stereochemical complementarity to a topographic region of a 
molecule of the EGF receptor family and modulates an activity mediated by 
the molecule, wherein the molecule is chamcterised by 

(i) amino acids 1-621 of the EGF receptor positioned at atomic 

35 coordinates substantially as shown in Figure 6; 
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(ii) one or more subsets of said amino acids related to the coordinates 
shown in Figure 6 by whole body tremslations and/or rotations; or 

(iii) amino acids present in the amino acid sequence of a member of 
the EGF receptor family, which form an equivalent three-dimensional 

5 structure to that of the receptor site defined by amino acids 1-621 of the EGF 
receptor positioned at atomic coordinates substantially as shown in Figure 6; 

with the proviso that the compovmd is not a naturally occiuxing ligand 
of a molecule of the EGF receptor family or a mutant thereof. 

By "mutant" we mean a ligand which has been modified by one or 
10 more point mutations, insertions of Eimino acids or deletions of amino acids. 

In a preferred embodiment of the fovulJa aspect, the topographic region 
of the molecule is defined by amino acids 1-475 of the EGF receptor or an 
amino acid sequence which forms an equivalent three-dimensional structure 
to that of the region defined by amino acids 1-475 of the EGF receptor 
15 positioned at atomic coordinates substantially as shovm in Figure 6. 

In a further preferred embodiment of the foiulh aspect, the topographic 
region of the molecule is defined by amino acids 313-621 of the EGF receptor 
or an amino acid sequence which forms an equivalent three-dimensional 
structure to that of the region defined by amino acids 313-621 of the EGF 
20 receptor positioned at atomic coordinates substantially as shown in Figiue 6. 
In preferred embodiments of the third and foxuth aspects, the 
stereochemical complementarity between the compound and the receptor 
site is such that the compound has a for the receptor site of less than 
10"*M, more preferably less than 10'*M. 
25 In some embodiments of the third and fourth aspects, the compoimd 

increases an activity mediated by the EGF receptor. 

In other embodiments of the third and fourth aspects, the compoimd 
decreases an activity mediated by the EGF receptor. 

In a fifth aspect, the present invention provides a pharmaceutical 
30 composition for preventing or treating a disease which would benefit fitam 
increased signalling by a molecule of the EGF receptor family, which 
comprises a compoimd according to the third or fourth aspects of the present 
invention and a pharmaceutically acceptable carrier or diluent. 

In a sixth aspect, the present invention provides a pharmaceutical 
35 composition for preventing or treating a disease associated with signalling by 
a molecule of the EGF receptor family which comprises a compound 
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according to the third or fourth aspects of the present invention and a 
pharmaceutically acceptable carrier or diluent. 

In a seventh aspect the present invention provides a method of 
preventing or treating a disease which would benefit from increased 
5 signalling by a molecvde of the EGF receptor family which method comprises 
administering to a subject in need thereof a compound according to the third 
or fourth aspects of the present invention. Preferably, the disease is selected 
from woimd healing and gastric ulcers. 

In an eighth aspect the present invention provides a method of 

10 preventing or treating a disease associated with signalling by a molecule of 
the EGF receptor family which method comprises administering to a subject 
in need thereof a compovmd according to the third or foiirth aspects of the 
present invention. Preferably, the disease is selected from psoriasis and 
tumoiu states comprising but not restricted to cancer of the breast, brain, 

15 ovaiy, cervix, pancreas, lung, head and neck, and melemoma, 
rhabdomyosarcoma, mesothelioma and glioblastoma. 

Throughout this specification, the word "comprise", or variations such 
as "comprises" or "comprising", will be understood to imply the inclusion of a 
stated element, integer or step, or group of elements, integers or steps, but 

20 not the exclusion of any other element, integer or step, or group of elements, 
integers or steps. 

Brief Description of the Drawings 

25 Figure 1: Sequence alignment of human EGF receptor family proteins with 
IGF-1 receptor sequences and insulin receptor sequence for the first two 
domains of the EGF receptor. The alignment of the EGF receptor and the 
various IGF-1 receptor sequences were used by the MODELLER program to 
create a model of the EGF receptor domains LI and Si. Residues which are 

30 underlined were used to create additional Ca-Ca restraints for the 

construction of the EGF receptor model. Disulfide bonds are also indicated by 
lines between C3reteine residues. The modules of the EGF receptor Si domain 
are numbered. 

35 Figure 2: Sequence alignment of human EGF receptor family proteins with 
IGF-1 receptor sequences and insulin receptor sequence for the third and 
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fourth domains of the EGF receptor. Additional labels and hnes are similar to 
those in figure 1. 

Figure 3: Model polypeptide fold of the Ll and Si domains of the EGF 
receptor. The Ll is at the left hand side of the structure with the N-terminus 
facing the front. Cysteine residue sidechains are depicted as sticks. 

Figure 4: Model polypeptide fold of the L2 and S2 domains of the EGF 
receptor. The L2 is at the bottom of the structure with the N-terminus facing 
the front. Cysteine residue sidechains are depicted as sticks. 

Figure 5: Superposition of the two models (of the Ll and Si domain and of 
L2 and S2 domains) onto the structure of the first three domains of the IGF-1 
receptor. Cysteine residue sidechains are depicted as sticks. Selected 
residues are shown as spheres and labelled. 

Figure 6: Coordinates of the two models of the EGF receptor extracellular 
domain. The first model consists of the domains Ll and Si. The second 
model consists of the domains L2 and S2. The coordinates are in relation to a 
Cartesian set of orthogonal axes. The Ll. Si and L2 domains of the EGF 
receptor models have been superimposed on the crystal structure of the IGF-1 
receptor domains Ll, cysteine-rich domain and L2. The final column 
contains the number 20, 40 or 60. depending on whether the residue 
containing the atom is judged to be well-modelled, have a moderate 
possibility of error, or is likely to be inaccurate, respectively. 

Figure 7: Part of the model polypeptide fold of the Ll and Si domains of 
the EGF receptor. Side chains of residues from the Ll domain which face 
towards the large cavity (shown in Figure 5) are shown in ball and stick 
notation and labelled with residue number and the one letter code. 

Figure 8: Part of the model polypeptide fold of the Ll and Si domains of 
the EGF receptor. Side chains of residues from the Si domain which face 
towards the large cavity (shown in Figure 5) are shown in ball and stick 
notation and labelled using the one letter code. 
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Figure 9: Part of the model polypeptide fold of the L2 and S2 domains of 
the EGF receptor. Side chains of residues from the L2 domain which face 
towards the large cavify (shown in Figure 5) are shown in ball and stick 
notation and labelled using the one letter code. 

5 

Figure 10: Part of the model polypeptide fold of the L2 and S2 domains of 
the EGF receptor. Solvent exposed residues from the face of the L2 domain 
containing the large P sheet are shown in ball and stick representation. 

10 Detailed description of Preferred Embodiments of the Invention 

The present inventors have developed three dimensional structural 
information about the EGF receptor to enable a more accurate xmderstanding 
of how the binding of ligand leads to signal transduction. Such information 
provides a rational basis for the development of ligands for specific 

15 therapeutic applications, something that heretofore could not have been 
predicted de novo from available sequence data. 

The precise mechanisms underl}dng the binding of agonists and 
emtagonists to the EGF receptor are not ftilly clarified. However, the binding 
of ligands to the receptor site, preferably with an affinity in the order of 

20 IC^M or higher, is understood to arise from enhanced stereochemical 
complementarity relative to naturally occurring EGF receptor ligands. 

Such stereochemical complementarity, pmsuant to the present 
invention, is characteristic of a molecule that matches intra-site surface 
residues lining the groove of the receptor site as enmnerated by the 

25 coordinates set out in Figure 6. The residues lining the groove are depicted 
in Figures 7, 8 and 9. By "match" we mean that the identified portions 
interact with the surface residues, for example, via hydrogen bonding or by 
enthalpy-reducing Van der Waals interactions which promote desolvation of 
the biologically active compound within the site, in such a way that retention 

30 of the biologically active compoxmd within the groove is favoiuBd 
energetically. 

Substances which are complementary to the shape of the receptor site 
characterised by amino acids positioned at atomic coordinates set out in 
Figiu^ 6 may be able to bind to the receptor site and, when the binding is 
35 sufficiently strong, substantially prohibit binding of the natiurally occurring 
Ugands to the site. 
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It will be appreciated that it is not necessary that the complementarity 
between ligands and the receptor site extend over all residues lining the 
groove in order to inhibit binding of the natmal ligand. Accordingly, 
agonists or antagonists which bind to a portion of the residues lining the 
groove are encompassed by the present invention. 

In general, the design of a molecule possessing stereochemical 
complementarity can be accomphshed by means of techniques that optimize, 
either chemically or geometrically, the "fir between a molecule and a target 
receptor. Known techniques of this sort are reviewed by Sheridan and 
Venkataraghavan, Acc. Chem Res. 1987 20 322; Goodford. J. Med. Chem. 
1984 27 557; Beddell, Chem. Soc. Reviews 1985, 279; Hoi, Angew. Chem. 
1986 25 767 and Verlinde C.L.M.J & Hoi, W.G.J. Structure 1994, 2. 577. the 
respective contents of which are hereby incorporated by reference. See also 
Blundell et al.. Nature 1987 326 347 (drug development based on information 
regarding receptor structure). 

Thus, there are two preferred approaches to designing a molecule, 
according to the present invention, that complements the shape of the EGF 
receptor. By the geometric approach, the number of internal degrees of 
freedom (and the corresponding local minima in the molecular conformation 
space) is reduced by considering only the geometric (hard-sphere) 
interactions of two rigid bodies, where one body (the active site) contains 
"pockets" or "grooves" that form binding sites for the second body (the 
complementing molecule, as ligand). The second preferred approach entails 
an assessment of the interaction of respective chemical groups ("probes") 
with the active site at sample positions within and around the site, resulting 
in an array of energy values from which three-dimensional contour surfaces 
at selected energy levels can be generated. 

The geometric approach is illustrated by Kuntz et al.. J. Mol. Biol. 1982 
161 269. the contents of which are hereby incorporated by reference, whose 
algorithm for ligand design is implemented in a commercial software package 
distributed by the Regents of the University of California and further 
described in a document, provided by the distributor, which is entitled 
"Overview of the DOCK Package. Version 1.0.". the contents of which are 
hereby incorporated by reference. Pursuant to the Kuntz algorithm, the 
shape of the cavity represented by the EGF receptor site is defined as a series 
of overlapping spheres of different radii. One or more extant databases of 
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crystallographic data, such as the Cambridge Structural Database System 
maintained by Cambridge University (University Chemical Laboratory, 
Lensfield Road, Cambridge CB2 lEW, U.K.) and the Protein Data Bank 
maintained by Brookhaven National Laboratory (Chemistry Dept. Upton, NY 
5 11973, U.S.A.), is then searched for molecules which approximate the shape 
thus defined. 

Molecules identified in this way, on the basis of geometric parameters, 
can then be modified to satisfy criteria associated with chemical 
complementarity, such as hydrogen bonding, ionic interactions and Van der 

10 Waals interactions. 

The chemical-probe approach to hgand design is described, for 
example, by Goodford, J. Med. Chem. 1985 28 849, the contents of which are 
hereby incorporated by reference, and is implemented in several commercial 
software packages, such as GRID (product of Molecular Discovery Ltd., West 

15 Way House, Elms Parade, Oxford 0X2 9LL, U.K.). Pursuant to this approach, 
the chemical prerequisites for a site-complementing molecule are identified 
at the outset, by probing the active site (as represented via the atomic 
coordinates shown in Fig. 1) with different chemical probes, e.g., water, a 
methyl group, an amine nitrogen, a carboxyl oxygen, and a h3rdroxyl. 

20 Favored sites for interaction between the active site and each probe are thus 
determined, and from the resulting three-dimensional pattern of such sites a 
putative complementary molecide can be generated. 

Programs suitable for searching three-dimensional databases to identify 
molecules beEiring a desired pharmacophore include: MACGS-3D and ISIS/3D 

25 (Molecular Design Ltd., San Leandro, CA), ChemDBS-3D (Chemical Design 
Ltd., Oxford, U.K.), and Sybyl/3DB Unity (Tripos Associates, St Louis, MO). 

Programs smtable for pharmacophore selection and design include: 
DISCO (Abbott Laboratories, Abbott Park. IL), Catalyst (Bio-CAD Corp., 
Moimtain View, CA), and ChemDBS-3D (Cheraiced Design Ltd., Oxford, 

30 U.K.). 

Databases of chemical structures are available from a number of 
somres including Cambridge Crystallographic Data Centre (Cambridge, U.K.) 
and Chemical Abstracts Service (Colmnbus, OH). 

Denovo design programs include Ludi (Biosym Technologies Inc., San 
35 Diego, CA), Sybyl (Tripos Associates) and Aladdin (Daylight Chemical 
Information Systems, Irvine, CA). 
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Those skilled in the art will recognize that the design of a mimetic may 
require slight structural alteration or adjustment of a chemical structiure 
designed or identified using the methods of the invention. 

The invention may be implemented in hardware or software, or a 
5 combination of both. However, preferably, the invention is implemented in 
computer programs executing on programmable computers each comprising 
a processor, a data storage system {including volatile and non-volatile 
memory and/or storage elements), at least one input device, and at least one 
output device. Program code is applied to input data to perform the 

10 functions described above and generate output information. The output 

information is applied to one or more output devices, in known fashion. The 
computer may be, for example, a personal computer, microcomputer, or 
workstation of conventional design. 

Each program is preferably implemented in a high level procedural or 

15 object-oriented programming language to communicate with a computer 

system. However, the programs can be implemented in assembly or machine 
language, if desired. In any case, the language may be compiled or 
interpreted language. 

Each such computer program is preferably stored on a storage mediiun 

20 or device (e.g., ROM or magnetic diskette) readable by a general or special 

purpose programmable computer, for configuring and operating the computer 
when the storage media or device is read by the computer to perform the 
procedures described herein. The inventive system may also be considered 
to be implemented as a computer-readable storage medium, configiuBd with 

25 a computer program, where the storage medium so configured causes a 
computer to operate in a specific and predefined maimer to perform the 
fimctions described herein. 

Compoimds designed according to the methods of the present 
invention may be assessed by a number of in vitro and in vivo assays of 

30 hormone function. For example, the identification of EGF receptor 

antagonists of may be undertaken using a solid-phase receptor binding assay. 
Potential antagonists may be screened for their ability to inhibit the binding 
of europium-labelled EGF receptor ligands to soluble, recombinant EGF 
receptor in a microplate-based format Europium is a lanthanide fluorophore, 

35 the presence of which can be measured using time-resolved fluorometry. The 
sensitivity of this assay matches that achieved by radioisotopes, 
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measurement is rapid and is performed in a microplate format to allow high- 
sample throughput, and the approach is gaining wide acceptance as the 
method of choice in the development of screens for receptor 
agonists/antagonists ( see Apell etal. J. Biomolec. Screening 3:19-27, 1998 : 
5 Llglese et. al. Biochemistry 37:2372-2377, 1998). 

Binding affinity and inhibitor potency may be measured for candidate 
inhibitors using biosensor technology. 

The EGF receptor antagonists may be tested for their ability to 
modulate receptor activity using a cell-based assay incorporating a stably 

10 transfected, EGF-responsive reporter gene (Souriau, C, Fort, P., Roux, P., 

Hartley, O., Lefranc, M-P., Weill, M., 1997, Nucleic Acids Res. 25:1585-1590). 
The assay addresses the ability of EGF to activate the reporter gene in the 
presence of novel Ligands. It offers a rapid (results within 6-8 hours of 
hormone exposure), high-throughput (assay can be conducted in a 96-well 

15 format for automated counting) analysis using an extremely sensitive 

detection system (chemiluminescence). Once candidate compoimds have 
been identified, their abiUty to antagonise signal transduction via the EGF-R 
can be assessed using a number of routine in vitro cellular assays such as 
inhibition of EGF-mediated cell proliferation. Ultimately, the efficiency of 

20 antagonist as a tumour therapeutic may be tested in vitro in animals beating 
tumour isografts and xenografts as described (Rockwell, P., O'Connor, W,J., 
King, K., Goldstein, N.I., Zhang, L.M.. Stein, C.A., 1997, Proc Natl Acad Sci U 
S A 94:6523-6528; Prewett. M, Rothman, M., Waksal, R, Feldman, M., 
Bander, N.H., fficklin, D.J., 1998 Chn Cancer Res 4:2957-2966). 

25 Tumoiur growth inhibition assays may be designed aroimd a nude 

mouse xenograft model using a range of cell lines. The effects of the receptor 
antagonists and inhibitors may be tested on the growth of subcutaneous 
tumours. 

Comparative modellinR 

30 The comparative modelling method exploits the observation that 

proteins with more than 25% amino acid identity will almost alwa3rs have a 
similar protein backbone (Sander, C. And Schneider, R., 1991, Proteins: 
Structure Function and Genetics, 9, 56-68). In some cases, proteins will have 
similar backbone structures with a lower proportion of identical amino acids. 

35 By aligning the sequence of a (target) protein which is to be modelled with 
the sequences with known structures (the templates), a model of the protein 
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can be obtained. Where a region of the target sequence follows the sequences 
of a template, the backbone of the target is built to follow that of the 
template. Where the target sequence can not be aligned to a target sequence, 
the so-called insertion must be constructed by other means (Greer, J., 1991, 
5 Meth. Enzym. pp 239-252). 

The MODELLER program (Sali, A and Blundell. T.L., 1993. J. Mol. Biol. 
234, 779-815) is a semi-automated approach to building models of proteins 
given the structures of one or more template structures and an alignment 
between the sequences of the target protein and the templates. Based on the 

10 sequence alignment and a set of rules derived firom the analysis of sets of 
aligned structure, the program generates a series of restraints for variables 
such £is Ca-Ca distances, main chain and side chain dihedral euigles for the 
target structure. The restraints are expressed in terms of probability density 
functions (PDFs). The PDFs are combined to yield an expression for the most 

15 probable structure as a function of the variables (Ca-Ca distances etc). The 
program then attempts to find structures to maximise the value of this 
function. In effect, the program attempts to minimise a transformed version 
of this function. 

While some comparative modelling approaches involve the explicit 
20 building of regions of the model for which there is no sequence alignment 
with a template, the MODELLER program constructs PDFs for these regions, 
thus including them in the consideration of constructing a comparative 
model. It is conceivable that once a comparative model has been constructed 
using MODELLER than an algorithm to build the structures of these regions 
25 is applied. 

The MODELLER program was used to build the structures of the 
extracellular portion of the EGF receptor using the 3D structure of the IGF-1 
receptor (as described in PCT/AU98/00998) as a template. The description of 
the generation of these models is outlined below. 

30 Construction of the alignment 

The region of the IGF-1 receptor whose structure is known (Garrett, 
T.P., McKem. N.M., Lou. M., Frenkel, M.J., Bentley, J.D., Lovrecz, G.O., 
Elleman, T.C.. Cosgrove, L.J., Ward, C.W., 1998 Nature 394:395-399) consists 
of three domains, the Ll domain, cysteine-rich domain (CRD) and the L2 

35 domain (in order of increasing residue number). The Ll and L2 domains 
adopt similar folds, each consisting of a single-stranded right-hand p-helix. 
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The helix contains three p-sheets which make up the left and right sides and 
the bottom, of the p-helix. The top is less regular. This tjrpe of p-helix has 
been dubbed a "breadloaf '. The cysteine-rich domain (CRD) consists of eight 
small modules, each of which has one or two disulfide bonds. The first three 
5 modules of the CRD contain two disulfide bonds which have a Cysl-Cys3 
and Cys2-Cys4 disulfide pairing arrangement. The next four have a single 
disulfide bond with a so-called p-finger structiure. The eighth module of the 
CRD contains one disulfide bond but is not a p-finger. 

The sequence of the EGF receptor extracellular domain can be divided 

10 into four domains, Ll, Si, L2 and S2 (in order of increasing residue nmnber) 
on the basis of internal homology and homology with the insulin receptor 
family (Ward, C.W., Hoyne, P.A., Flegg, R.H., 1995, Proteins 22:141-153; 
Bajaj, M., Waterfield, M.D., Schlessinger, J.. Taylor, W.R., Blundell, T.. 1987, 
Biochim Biophys Acta 916:220-226). The Ll and L2 domains are similar in 

15 sequence to each other and to the Ll and L2 domains in the IGF-1 receptor. 
The Si and S2 domains are similar in sequence and also similar to the CRD 
of the IGF-1 receptor. These three domains contain a large number of 
cysteine residues, although the S2 domain of the EGF receptor has two less 
cysteine residues than does the CRD of the IGF-1 receptor and the Si domain 

20 of the EGF receptor. 

Two important sequence motifs are found in the EGF receptor 
sequence which are conserved in other EGF receptor homologues. The first 
motif is the sequence CXXXXXXW which is found near the end of the 
sequences of the Ll and L2 domains of the EGF receptor and its homologues 

25 where C is cysteine and W is tr3?ptophan. (The motif in the Ll domain of the 
EGF receptor consists of C133-W140 and in the L2 domain consists of C446- 
W453.) The second motif is the sequence CW which occurs near the start of 
the SI and S2 domains of the EGF receptor (C175-W176 in the Si domain 
and C491-W492 in the S2 domain). The two motifs also occur in the insulin 

30 receptor family (C120XXXXXXW127 and C175 Wl 76 in IGF-1 receptor) in the 
Ll domain and cysteine-rich domain respectively. In contrast to the EGF 
receptor and its homologues the first of these two motifs does not occur in 
the L2 domain of the insulin receptor family. Structurally, the first motif 
corresponds to part of the Ll domain which allows penetration of the 

35 tryptophan residue of the second motif into the p-helix. As the first sequence 
motif is absent from the L2 domain of the IGF-1 receptor, very Uttle of the 
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Structure of this domain was used as a template in the modelling of the EGF 
receptor. 

Construction of the alignment of Ll and Si 

As the Ll domain of the IGF-1 receptor has a defined core, the 
5 sequence alignment was memually constructed with a view to placing most of 
the conserved hydrophobic residues of the EGF receptor such that their side 
chains point towards the P-helical core. Homologues of the EGF receptor 
were included in the alignment to assist with the identification of such 
residues (Figure 1). Other IGF-1 receptor residues whose positions were 

10 conserved were the four cysteine residues in the Ll domain and the residues 
Arg 77, Trp 127, Trp 176 and Gin 182. Two small regions of the IGF-1 
receptor were also included in the alignment. The first of these regions 
includes the sequence Ser 375 - Lys 380 from the L2 domain of the IGF-1 
receptor and is used as a template for modelling the EGF receptor residues 

15 Asp 51 - Lys 56. Additional flanking residues were also used. Residues He 
385 - Phe 397 of the IGF-1 receptor were also used as a template to better 
model the EGF receptor residues lie 83 - Leu 95 (Figure 1). 

The alignment of the Si domain of the EGF receptor to the cysteine- 
rich domain of the IGF-1 receptor used the same combination of modules. All 

20 of the putative modiiles of the EGF receptor Si domain were aligned to part 
or all of the corresponding module of the CRD of the IGF-1 receptor. The 
third module of the IGF-1 receptor CRD (Cys 201 - Cys 218) was used as an 
additional template to the first (Cys 166 - Cys 183) and second (Cys 191 - Cys 
207) putative modules of the EGF receptor Si domain. The residues Cys 230 - 

25 Cys 246 of IGF-1 receptor, which include the protein's fifth module, were 

aligned to the EGF receptor residues Cys 267-Cys 283 (which include the EGF 
receptor Si domain's putative sixth module). 
Construction of the alignment of L2 and S2 

Construction of the alignment of the sequence of the L2 domain of the 

30 EGF receptor to the sequence of the Ll domain of the IGF-1 receptor followed 
similar principles to that of the alignment of the Ll domain of the EGF 
receptor. The region lie 385 - Phe 397 of the IGF-1 receptor served as an 
additional template and its sequence was aligned to lie 402 - Leu 414 of the 
EGF receptor (Figiire 2). 

35 An analysis of 3-finger modules in the IGF-1 receptor, TNF receptor 

and the laminin-y structvuBS revealed that these modules could be classified 
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into three types exhibiting some structural and sequence conservation. Two 
of the structural types are relevant to the IGF-1 receptor and the EGF 
receptor. The first type of p-finger is characterised by structiual conservation 
of the C-terminal part of the module and also of the linker region after the 
5 module. The signature sequence is C. . .CXXC where the third cysteine 

residue is the start of another p-finger module. The second type of p-finger is 
characterised by structural conservation of the N-terminal portion of the 
module and also of the linker region after the module. The signature 
sequence is C. . .CXXXC where the third cysteine is the start of a module 

10 whose disulfide bonding pattern has a Cys 1-Cys 3, Cys 2-Cys 4 arrangement 
Comparison of the sequences of the modules of the IGF-1 receptor CRD 
with the sequence of the EGF receptor S2 domains suggested that the 
arrangement of modules in the S2 domain were different from those of the 
IGF receptor CRD and the EGF receptor Si domain. The residues of the third 

15 module in the CRD of the IGF-1 receptor, Cys 201-Cys 218, could be aligned 
with the segments of the EGF receptor S2 domain sequence: Cys 482 - Cys 
499; Cys 534 - Cys 555 and Cys 596 - Cys 612. These modules are the putative 
first, fourth and seventh modules of the S2 domain. The residues of the first 
EGF receptor module were also aligned to residues Cys 152 - Cys 181 of the 

20 first module of the IGF-1 receptor CRD. The residues of the fourth module in 
the CRD of the IGF-1 receptor, Cys 221 - Cys 230, a beta-finger modiile of the 
first type described above, could be aligned with the regions of sequence Cjrs 
502 - Cys 511 and Cys 558 - Cys 567. These two regions of the EGF receptor 
S2 domain are the putative second and fifth modules. By elimination, the 

25 regions between the two sets of remaining cysteine residues (the putative 
third and sixth modules) were assigned as p-finger modules of the second 
type. These regions of sequence are followed by three residues and then a 
module containing four cysteine residues. The N-termincd regions of the fifth 
(Cys 234 - Cys 246) and seventh modules (Cys 277 - Cys 291) of the IGF-1 

30 receptor CRD were both aligned to the N-terminal regions of the two modules 
(Cys 515 - Cys 531 and Cys 571 - Cys 593). 

In the IGF-1 receptor CRD, there is no occurrence of a P-finger module 
being followed by a module containing four cysteine residues. Thus, the 
positioning of the fourth module in the EGF receptor S2 model relative to the 

35 third module is essentially arbitrary. The same applies to the positioning of 
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the seventh module relative to the sixth module of the EGF receptor S2 
domain model. 
Construction of the model 

Version 3 of the MODELLER program (Modeler User Guide, October 
5 1996, San Diego Molecular Simulations Inc) was used to build models of the 
EGF receptor. The various sequences of the IGF-1 receptor and the EGF 
receptor shown in Figure 1 were used as the alignment for the construction of 
the model of the Ll and Si domains of the EGF receptor. The coordinates of 
each of the IGF-1 receptor sequences (i.e. the templates) shown in Figure 1 

10 were also used as input for the MODELLER program. Additional distance 
restraints were generated between Ca atoms of selected residues. The 
restraints were generated as follows. The small IGF-1 receptor templates 
were superimposed into the structure of the first two domains of the IGF-1 
receptor using the Ca atoms of the residues which are aligned in Figure l. 

15 Using the Homology module of the Insight II program (Homology User Guide, 
October 1995. San Diego BIOSYM/MSI) coordinates were built for the EGF 
receptor residues which are aligned to the IGF-1 receptor coordinates which 
are in bold t3^eface. From these coordinates, distance restraints in the form 
of Gaussian curves were constructed for pairs of Ca atoms with a distance 

20 less than 5oA. The sigma value of the Gaussian curves was set to be zA. A 
MODELLER run was submitted using the alignment in Figure 1, The built 
models of proteins attempt to satisfy these restraints in addition to the 
restraints the program derives from the alignment. 

The aligned IGF-1 receptor and EGF receptor sequences of Figure 2 

25 were used as the alignment for creating the model of the L2 and S2 domains 
of the EGF receptor. The coordinates of the each of the IGF-1 receptor 
sequences shown in FigtuB 2 were used as the structvual templates. Two 
separate sets of additional restraints were used. The first set were based on 
the underlined IGF-1 receptor residues which are aligned to EGF receptor 

30 residues Cys 482 - Cys 534 (the first module of the S2 domain to the first 

cysteine of the fourth module). From the coordinates of the Ca atoms of these 
residues, distance restraints in the form of Gaussian curves were constructed 
for pairs of Ca atoms with a distance less than 5oA. The second set of 
additional restraints were based on the Qx atoms of the imderlined IGF-1 

35 residues which are aligned to EGF receptor residues Cys 534 - Cys 596 (the 
fourth module of the S2 domain to the first cysteine of the seventh module). 
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The signal value of the Gaussian curve used to construct the additional 
restraints was lA. 

For both sets of models, the MODELLER program constructed 20 
models whose coordinates were perturbed {mm an initial structure by a 
5 random value of maximtun distance 4A. The refinement level used was the 
'refinel' option in the MODELLER program. 

Most of the insertion regions of the EGF receptor models were 
constructed using the "loop" routine of version 4 of MODELLER (Modeler 
User Guide, June 1997, San Diego Molecular Simulations Inc) . Coordinates 

10 for each insertion were built using one of the two models obtained in the 

previous section as a scaffold. The regions of sequence for which coordinates 
were built in this manner were 1-5, 8-12, 16-23, 46-51, 101-107, 145-148, 184- 
191. 241-262, 319-328, 522-530, 540-546, 578-600 and 612-621. Coordinates 
for residues 351-368 and 387-393 were constructed simultaneously due to the 

15 proximity of these regions in the model of the L2 domain. For each insertion, 
50 models were constructed. In cases where the generated loops with the 
lowest scores had similar backbone structures, the loop building process was 
considered to have converged and the coordinates of the loop replaced those 
of the same residues on the refined model. Where the loop structures did not 

20 converge, the structures with the three lowest MODELLER loop scores were 
evaluated using Procheck (Laskowski RA, MacArthm- MW, Moss DS, 
Thornton JM. (1993). J Appl. Crystallogr 26: 283-291), ProsaH (Hendlich M, 
Lackner P, Weitckus S, Floeckner H, Froschauer R, Gottsbacher K, Casari G, 
Sippl MJ. (1990) J Mol Biol 216:167-180.; Sippl h/Q. (1993) Proteins 17: 355- 

25 362.) and Profiles-3D (Bowie JU. Luthy R, Eisenberg D. (1991) Science 

253:164-170.; Liithy R, Bowie JU, Eisenberg D. 1992. Nature 356:83-85.), For 
several of these loops, the one with the second lowest MODELLER score was 
selected as it had a more favorable Profiles3D and ProsaH plot 

hi order to retain certain secondary structures, additional restraints 

30 were used in the construction of some of the loops. Restraints with the form 
of a right-handed half-Gaussian function with a s value of O.OSA were used to 
hold selected mainchain N-O distances to 3.oA or less. The atom pairs for 
which this additional restraint was added were: Gin 139.N - Gin 184.0E1, Val 
268.N - Tyr 261.0, Val 268.0 - Tyr 261.N, Ser 506.N - Ser 529, Ue 562.N - ffis 

35 591.0 and Glu 578.N - Val 592. 
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Structure of the EGF receptor model 

The structure of the Ll and Si domains of the EGF receptor as 
determined by the modelling described above is shown in Figure 3, while the 
structure of the L2 and S2 domains is shown in Figure 4. The superposition 
5 of these two models onto the structure of the extracellular domains of the 
IGF-1 receptor is shown in Figure 5. 

The coordinates of the EGF receptor domains Ll, Si, L2 and S2 are 
shown in Figme 6. 

Figines 7, 8 and 9 show the sidechains of residues of the EGF receptor 

10 models which face the large cavity as shown in Figme 5. Figure 10 shows the 
sidechains of residues of the face of the EGF receptor L2 domain which 
contains the second beta sheet (the lower face of the L2 domain using the 
orientation shown in Figure 4). 

The structures of the Ll and Si domains are similar to those of the 

15 IGF-1 receptor structure, as expected. There are three major differences in the 
Si domain of the EGF receptor model from the structure of the IGF-1 receptor 
cysteine-rich domain. The first module of the Si domain is noticeably 
snaaller than that of the IGF-1 receptor CRD. The sixth modide (Cys 271 - Cys 
283) of the Si domain is smaller than that of the IGF-1 receptor emd occupies 

20 less of the region between the Ll and L2 domains. The fifth module (Cys 240 
- Cys 267) contains a large insertion which points away from the Ll domain. 
The eighth module of the EGF receptor Si domain (Cys 305 - Cys 309) and 
the linker region (Arg 310 - Val 312) which follows it are similar in structure 
to the analogous regions of the IGF-1 receptor. Like the IGF-1 receptor, the 

25 linker region is postulated to be a hinge region about which the Si domain 
and the L2 doniEun can reorient. 

A region of the EGF receptor in the L2 domain which covdd not be 
aligned with the IGF-1 receptor includes the residues Trp 386 - Pro 387 
which are conserved across the EGF receptor family. This sequence motif is 

30 not found in the insulin receptor family and may represent a region of novel 
structure. 

The amino acids 352-367 correspond to a large insertion in the L2 
domain of the EGF receptor. The amino acids 351-364 have been identified as 
the epitope for several antibodies against the EGF receptor (Wu, D.G., Wang, 
35 L.H., Sato. G.H.. West. K.A., Harris, W.R.. Crabb. J.W.. Sato. J.D.. 1989. J. Biol. 
Chem. 264:17469-17475). This region forms a loop which sticks out of the 



wo 99/62955 



24 



PCT/AU99/00420 



surface is consistent with this region being accessible to antibodies. The 
structure itself is difficult to model accurately since its sequence does not 
correspond to any part of the IGF-1 receptor sequence. The position of this 
insertion is in approximately the same region as where the IGF 1 receptor 
differs in backbone structure. 

The S2 domain model of the EOF receptor adopts a different 
arrangement of modules and consequently a different shape that of the CRD 
of the IGF-1 receptor and the Si domain model of the EGF receptor. The 
disulfide bond arrangement is the same as that predicted by similarity to the 
tumour necrosis receptor {Ward, C.W., Hoyne, P.A., Flegg. R.H., 1995, 
Proteins 22:141-153) and has since been confirmed by mass spectroscopic 
analyses of proteolytically digested EGF receptor extracellular domain (Abe. 
Y., Odaka, M., Inagaki, F., Lax. I.. Schlessinger. J.. Kohda. D., 1998. J. Biol. 
Chem. 273:11150-11157). The only significant contact of the S2 domain with 
the L2 domain of the EGF receptor model is the intercalation of Trp 492 into 
the L2 domain, analogous to that made by Trp 176 in the Si domain of the 
EGF receptor and Trp 176 in the CRD of the IGF-1 receptor to their respective 
Ll domains. Unlike the Si domain of the EGF receptor, the rest of the S2 
domain does not make any contacts with the L2 domain. The S2 domain is 
rod-like and points out from the L2 domain with a different geometry to the 
manner in which the Si domain points out from the Ll domain. 
Putative bindiuR sites of the EGF receptor 

From the IGF-1 receptor structure and a number of insulin receptor 
mutants, one of the regions of insulin binding was proposed to the face of the 
Ll domain which contains the second p-sheet (Garrett, T.P., McKem, N.M., 
Lou, M., Frenkel, M.J., Bentley. J.D.. Lovrecz. G.O.. Elleman. T.C., Cosgrove. 
L.J., Ward. C.W., 1998 Nature 394:395-399). This surface is characterised by 
a number of hydrophobic residues which point out of the structure and also 
the presence of a structurally conserved loop. By analogy, we propose that 
the analogous p sheets of the Ll and L2 are potential bindmg sites. These 
sheets contain a number of hydrophobic residues, conserved amongst EGF 
receptor family members, which point away from the core of the P-helix 
structure. Residue 45 of a mutant EGF has been cross-Hnked to the residue 
Lysine 465 which is in the last strand of the lower p sheet of the L2 domain. 
(Sununerfield, AE et al. J Biol Chem, 1996, 271(33), 19656-19659). Tyrosine 
101 has been cross-linked to the N-terminus of EGF (Woltjer, RL et al. PNAS, 
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1992, 89(16). 7801-7805). This residue is in the portion of sequence which 
inunediately follows a strand in the lower P sheet of Ll. 

The side chain of asparagine 1 of EGF has been cross-linked to lysine 
336 of the EGF receptor (Wu, DG et al. PNAS, 1990, 87(8), 3151-3155). The 
5 latter residue is in the N-terminal helix of the L2 domain and points towards 
the cavity which is formed when the two halves of the EGF receptor are 
postioned in a similar arrangement to the first three domains of the IGF-1 
receptor. Two nearby residues, Asn 328 and Asn 337 are glycosylated. This 
mutation is in a similar position to the insulin receptor mutant S323L which 

10 has aberrent insulin binding. 

Several insertional mutants of the EGF receptor extracellular domain 
were constructed to probe the role of several regions of the receptor (Harte, 
M.T.. Gentry, L.E., 1995, Arch Biochem Biophys 322:378-389). A number of 
these mutants were not detectably secreted by the cells producing them, 

15 suggesting that they did not fold to form stable proteins. Most of these 
insertions were in positions in the model structure where they would be 
unable to tolerate an insertion. In contrast, most of the other insertions were 
in loops or other positions which, according to the model, are able to tolerate 
insertions. EGF receptor extracellular domain mutants with insertions at 

20 residues 162, 169, 174 and 220 bound EGF with a similar affinity to the wild- 
type EGF receptor extracelliilar domain but boimd TGF-a with a lower 
affinily. The first of these insertions was located one residue before the last 
cysteine residue of the Ll domain. The second and third insertions were 
present in the first module of the EGF receptor Si domain and the fourth was 

25 present in the third module of the Si domain. All of these positions are on a 
side of the molecule far removed from the large cavity as shown in Figure 5. 
EGF receptor mutants with insertions at positions 251 (in the fifth module of 
the Si domain) and 575 (in the sixth module of the S2 domain) appeared to 
bind twice as much ligand as the wild-type receptor. Two insertional mutemts 

30 which showed reduced EGF receptor binding contained insertions at 

positions 291 (in the seventh module of the Si domain) and 474 (one residue 
before the last cysteine of the L2 domain). 

Another EGF receptor mutant which shows altered ligand binding 
behaviour is the R497K mutant. The site of this mutation in the first module 

35 of the S2 domain and faces the side of the L2 domain opposite to that 

containing residue 465. This mutant binds EGF in a similar fashion as wild- 
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type receptor but abolishes the high affinity binding site for TGF-a (Moriai, 
T., Kobrin, M.S., Hope, C, Speck, L., Korc, M., 1994, Proc Natl Acad Sci U S 
A 91:10217-10221). 

On the faces containing the second p-sheet (the lower face according to 
5 the orientations shown in Figures 3 and 4) of the Ll and L2 domcdns are a 
number of solvent-exposed hydrophobic residues including Tyt 64, Leu 66. 
Tyr 89, Tyr 93, Leu 348, Phe 380 and Phe 412. According to a survey of 
protein-protein interfaces, tjrosine, phenylalanine and leucine are more 
likely to be involved in an interface than on the exterior of a protein complex 

10 (Tsai C-J. Lin SL. Wolfson, HJ. Nussinov R (1997) Protein Sci 6: 53-64). Lys 
465 is located on the lower face of the L2 domain and Tyr 101 is proximal to 
the lower face fo the Ll domain and are consistent with the lower faces of the 
domains having roles in ligand binding. 
Strategies for developing EGF receptor ligands 

15 For several signalling systems, ligand analogues which have antagonist 

properties have been described. These ligand include the human growth 
hormone (Chen WY, Chen NY, Yun J, Wagner TE, Kopchick JJ (1994) J Biol 
Chem 269:15892-15897), interleukin-6 (Savino R, Lahm A, Salvati AL, 
Ciapponi L, Sporeno E, Altamura S, Paonessa G, Toniatti C, Ciliberto G 

20 EMBO J 1994 Mar 15;13(6):1357-67) and interleukin-4 (Kruse N, Tony HP, 
Sebald W (1992) EMBO J 11:3237-3244; Zurawski SM, Vega F Jr. Huyghe B, 
Zurawski G (1993) EMBO J 12:2663-2670). The function of these unmodified 
ligands is to bind their receptors and then subsequently recruit a second 
receptor molecule. The mutations of the ligands mentioned above are in 

25 positions which interfere with the binding of the second receptor (de Vos 
AM, intsch M, Kossiakoff AA (1992) Science 255:306-312; Brakenhoff JP, de 
Hon FD, Fontaine V, ten Boekel E, Schooltink H, Rose- John S, Heinrich PC, 
Content J, Aarden LA (1994) J Biol Chem 269:86-93; Davis ID, Treutlein HR, 
Friedrich K, Bvirgess AW (1995) Growth Factors 12:69-83), 

30 To date, no analogues of EGF receptor ligands have been found which 

are purely antagonistic. Whether EGF and its homologues have sites of 
binding for two receptor molecules, like the proteins described above, heis 
not been shown. Anadysis of IH NMR transferred nuclear Overhauser 
enhancement data for titration of TGF-a with the extracellular domain of the 

35 EGF receptor indicates that most parts of the ligand are in contact with the 
receptor upon binding (Mclnnes C, Hoyt DW, Harkins RN, Pagila RN, 
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Debanne MT, O'Connor-McCourt M, Sykes BD (1996) J Biol Chem 271:32204- 
32211). However, the concentrations used in the experiment were such that 
the dominant receptor species was the Hgand-receptor complex with 2:2 
stiochiometry. However, even if the ligands of the EGF receptor are buried in 
the cleft formed by the first three domains of the receptor, it is difficult to 
envisage that such binding will lead to contact with most of the bound ligand 
when only one receptor binds the ligand. In an alternative scheme, at least 
two separate faces on EGF are required to bind into the large cleft of a single 
EGF receptor molecule which enacts a conformational change in the receptor 
which then allows it to dimerise. An antagonist may bind to the first binding 
site of the receptor and not the second, thus preventing dimerisation and 
subsequent signalHng of the receptor. Thus, delineation of the parts of the 
ligand involved in the [putative) primary and secondary binding faces would 
greatly assist antagonist design. 

Using the EGF receptor model and the known structures of EGF 
receptor ligands. it may be possible to construct a model, or a partial model, 
of ligand binding which could suggest which parts of bound ligand are 
involved in binding to the first and second EGF receptors of the ligand- 
receptor complex. There are several computer programs that can assist with 
the construction of such models. Programs such as Quilt (Lijnzaad P, Argos P 
(1997) Proteins 28:333-343 ; Lijnzaad P, Berendsen Hf, Argos P (1996) 
Proteins 26:192-203; Lijnzaad P, Berendsen HJ, Aigos P 1996 Proteins 25:389- 
397) can be used to suggest sites on proteins involved in interactions with 
other proteins. Possible structures of protein complexes can be obtained by 
programs such as FT-DOCK (Gabb HA, Jackson RM, Sternberg MJ (1997) J 
Mol Biol 272:106-120) and GRAMM (Vakser lA (1996) Biopolymers 39:455- 
464; Katchalski-Katzir E. Shariv I, Eisenstein M, Friesem AA. Aflalo C, 
Vakser JA (1992) Proc Natl Acad Sci U S A 89:2195-2199). The calculation of 
electrostatic potentials from the Poisson-Boltzmann equation has been used 
to investigate complexes made up of cytokines and growth factors and their 
receptors (Demchuk E. Mueller T. Oschkinat H. Sebald W, Wade RC (1994) 
Protein Sci 3:920-935) and may guide the construction of model complexes. 
The construction of models will suggest regions of the EGF receptor ligands 
which may be involved in receptor binding. With the model and supporting 
experiments, it is envisaged that mutants of EGF and TGF^ will be 
constructed which are potential antagonists. 
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The majority of targets for drugs which have made use of structural 
information are enzymes. One advantage of enz5rmes over other types of 
proteins is the presence of substrate-binding clefts whose normal function is 
to bind small molecule substrates or short lengths of peptides. In contrast, 
5 few small molecule inhibitors have been developed which inhibit protein- 
protein interactions. 

Desolvation of protein surfaces appears to be an important factor in the 
formation of a protein-protein complex. Since, unlike the substrate-binding 
clefts of enzjrmes, protein-binding surfaces tend to be much less concave, a 

10 boimd small molecule is imlikely to provide enough desolvation to enable 
tight binding. The lower surfaces of the Ll and L2 domains, which have been 
suggested to be involved in ligand binding, contEun hydrophobic regions 
which suggest that they need to be buried for strong binding of a molecule to 
these surfaces to occur. We envisage that cyclic molecules, including cyclic 

15 peptides, may be able to bind to such surfaces. Hydrophobic functional 

groups may be chosen which, when boimd to the hydrophobic regions of the 
relevent face, desolvate regions of the protein. Some of the functional groups 
which interact with the protein will be polar or charged to make favourable 
electrostatic interactions. Other parts in the cyclic molecule may be polar or 

20 charged to increase the aqueous solubility of the molecule. Cyclic molecules 
also have the advantages of having few possible conformations when 
imbound, providing a lower loss of entropy upon binding and thus greater 
binding as compared to a non-cyclic analogue. A degree of flexibility would 
exist and would allow the molecule to alter its conformation to better 

25 accommodate the protein it is binding to. 

Algorithms such as LUDI (Bohm HJ (1992) J Comput Aided Mol Des, 6: 
593-606) can be used to search for functional groups and molecular moieties 
which may interact with a surface of the EGF receptor model. Algorithms 
such as CLDC (Lawrence MC, Davis PC (1992) Proteins 12:31-41) or DOCK 

30 (Kuntz m, Blaney JM, Oatley SJ, Langridge R, Ferrin TE (1982) J Mol Biol 
161:269-88) can be used to search a database of molecular structures for 
those which have shape and/or chemical complementarity to the EGF 
receptor. Computational combinatorial design algorithms (Miranker A, 
Karplus M Proteins (1991) 11:29-34; Eisen MB, Wiley DC, Karplus M, 

35 Hubbard RE; Caflisch A (1994) Proteins 19:199-221) can also be tried. In one 
instance, a combinatorial approach has been used to design peptides to 
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inhibit the interaction of the proteins Ras and Raf (Zeng, J, et al. Protein 
Engineering, to be pubhshed). 

We envisage that as an alternative to a cycUc molecule, a small protein 
could be used as a scaffold for placing amino acids that will interact with the 
5 EGF receptor. At least one small protein (potato carboxypeptidase inhibitor) 
with a fold different to that of EGF receptor ligands has been identified 
which is a weak EGF antagonist (Blanco-Aparicio C, Molina MA, Femandez- 
Salas E, Frazier ML, Mas JM, Querol E, Aviles FX, de Llorens R (1998) J Biol 
Chem 273:12370-12377). The use of a structural scaffold for proteins with 

10 diverse fimctions has been observed in Native (Lin SL, Nussinov R 1995 Nat 
Struct Biol 2:835-837). Other molecular scaffolds such as dendrimers may 
also be considered which can be used to present the functional groups which 
will tightly interact with the EGF receptor. 

At least two, non-exclusive modes of action can be envisaged. The first 

15 mode involves a molecule competing for binding sites with one of the EGF 
receptor's natural ligands. Most likely, the molecule will prevent the receptor 
dimerisation which is required for activation of the EGF receptor, thus acting 
as an antagonist We do not rule out the possiblity that the binding may be 
activating and the moleciile acts as an agonist. The second potential mode of 

20 action is for the molecule to bind to a site on the EGF receptor which is not 
necessarily a ligand binding site. Such a molecule may be physically large 
enough to hinder physical access of a second receptor to the receptor which 
binds the molecule in question. This would hinder dimerisation and 
subsequent activation of the receptor. If the molecule is sufficiently "sticky", 

25 it may attract a second EGF receptor and induce dimerisation, thereby acting 
as an agonist rather them an antagonist. 

It will be appreciated by persons skilled in the art that numerous 
variations and/or modifications may be made to the invention as shown in 
the specific embodiments without departing from the spirit or scope of the 

30 invention as broadly described. The present embodiments are, therefore, to 
be considered in all respects as illustrative and not restrictive. 
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Claims: 

1. A method of designing a compoxmd which binds to a molecule of the 
5 EGF receptor family and modulates an activity mediated by the molecule, 

which method comprises the step of assessing the stereochemical 
complementarity between the compound and a topographic region of the 
molecule, wherein the molecule is characterised by 

(i) amino acids 1-621 of the EGF receptor positioned at atomic 
10 coordinates substantially as shown in Figure 6; 

(ii) one or more subsets of said amino acids related to the coordinates 
shown in Figure 6 by whole body translations and/or rotations; or 

(iii) amino acids present in the amino acid sequence of a member of 
the EGF receptor family, which form an equivalent three-dimensional 

15 structure to that of the receptor site defined by amino acids 1-621 of the EGF 
receptor positioned at atomic coordinates substantially as shown in Figure 6. 

2. A method as claimed in claim 1 in which the topographic region of the 
molecule is defined by amino acids 1-475 of the EGF receptor positioned at 

20 atomic coordinates substantially as shown in Figure 6, or an amino acid 
sequence which forms an eqmvalent' three-dimensional structure to that of 
the region defined by amino acids 1-475 of the EGF receptor positioned at 
atomic coordinates substantially as shown in Figure 6. 

25 3. A method as claimed in claim 1 in which the topographic region of the 
molecule is defined by amino acids 313-621 of the EGF receptor positioned at 
atomic coordinates substantially as shown in Figure 6, or an amino acid 
sequence which forms an equivalent three-dimensional structiuB to that of 
the region defined by amino acids 313-621 of the EGF receptor positioned at 

30 atomic coordinates substantially as shown in Figure 6. 

4. A method as claimed in any one of claims 1 to 3 in which the 
compound is designed so as to complement the structine of a topographic 
region of the molecule as depicted in Figure 5. 

35 
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5. A method as claimed in any one of claims 1 to 4, in which the 
compound has structural regions able to make close contact with amino acid 
residues at the surface of the molecule lining the groove as depicted in Figure 
7, Figure 8 or Figure 9. 

5 

6. A method as claimed in any one of claims 1 to 5, in which the 
compound has a stereochemistry such that it can interact with both the LI 
and L2 domains of the molecule. 

10 7. A method as claimed in any one of claims 1 to 5, in which the 

compound interacts with the region of the Ll domain-Sl domain interface, 
causing an alteration in the positions of the Ll and Si domains relative to 
each other. 

15 8. A method as claimed in any one of claims 1 to 5, in which the 

compound interacts with the hinge region between the L2 domain and the Si 
domain causing an alteration in the positions of the L2 and Si domains 
relative to each other. 

20 9. A method as claimed in any one of claims Ito 5, in which the 

compoimd interacts with the p-sheet of the Ll domain causing an alteration 
in the position of the Ll domain relative to the position of the SI domain or 
the L2 domain. 

25 10. A method as claimed in any one of claims 1 to 5 in which the 

compoimd has a stereochemistry such that it can interact with both the L2 
and S2 domains of the molecule. 

11. A method as claimed in any one of claims 1 to 5, in which the 

30 compound interacts with the hinge region between the L2 domain and the S2 
domains causing an alteration in the positions of the Ll and L2 domains 
relative to each other. 

12. A method as claimed in any one of claims 1 to 5, in which the 

35 compound interacts with the P-sheet of the L2 domain causing an alteration 
in the position of the L2 domain relative to the position of the S2 domain. 
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13. A method as claimed in any one of claims 1 to 5, in which the 
compound binds to a lower face containing the second p-sheet of the Ll 
and/or L2 domains, wherein the structure of the face is characterised by a 
plurality of solvent-exposed hydrophobic residues. 

14. A method according to claim 13, in which the hydrophobic residues 
include: 

(i) Tyr64, Leu66. Tyr89, Tyr93; and/or 

(ii) Leu348, Phe380 and Phe412. 

15. A method as claimed in anyone of claims 1 to 14, in which the 
stereochemical complementarity between the compound and the receptor 
site is such that the compound has a Kj for the receptor site of less than 

16. A method as claimed in claim 15 in which the IQ is less than 10"*M. 

17. A method as claimed in any one of claims 1 to 16 in which the 
compound is selected or modified from a known compound identified from a 
data base. 

18. A method according to any one of claims 1 to 17, in which the compound 
has the abihty to increase an activity mediated by a molecule of the EGF receptor 
family. 

19. A method according to any one of claims 1 to 18, in which the compound 
has the abihty to decrease an activity mediated by a molecule of the EGF 
receptor family. 

20. A method according to claim 19, in which the stereochemical interaction 
between the compound and the molecule is adapted to prevent the binding of a 
natural hgand of the receptor molecule to the receptor site. 



21 . A method according to claim 19 or claun 20, in which the compound has 
a K, of less than lO'^M. 



wo 99/62955 



33 



PCT/AU99/00420 



22. A method according to claim 21. in which the compound has a K, of less 
than 10"'M. 

23 . A method according to claim 22, in which the compoimd has a K, of less 
than 10"V. 

24. A computer-assisted method for identifying potential compounds able to 
bind to a molecule of the EGF receptor family and to modulate an activity 
mediated by the molecule, using a programmed computer comprising a 
processor, an input device, and an output device, comprising the steps of: 

(a) inputting into the programmed computer, through the input device, 
data comprising the atomic coordinates of the EGF receptor molecule as shown 
in Figure 6. or a subset thereof; 

(b) generating, using computer methods, a set of atomic coordinates of a 
sbructure that possesses stereochemical complementarity to the atomic 
coordinates of a topographic region of the EGF receptor molecule as shown in 
Figure 6, or a subset thereof, thereby generating a criteria data set; 

(c) comparing, using the processor, the criteria data set to a computer 
database of chemical structures; 

(d) selecting from the database, using computer methods, chemical 
structures which are similar to at least a portion of said criteria data set; and 

(e) outputting, to the output device, the selected chemical structures 
which are similar to a portion of the criteria data set. 

25 . A computer-assisted method according to claim 24, in which the method 
is used to identify potential compounds which have the ability to decrease an 
actiAdty mediated by the molecule. 

26. A computer-assisted method according to claim 24 or claim 25, which 
further comprises the step of selecting one or more chemical structures from 
step (e) which interact with the molecule in a manner which prevents the 
binding of natural hgands to the molecule. 

27. A computer-assisted method according to any one of claims 24 to 26, 
which further comprises the step of obtaining a compound with a chemical 
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Structure selected in steps (d) and (ej, and testing the compound for the ability to 
decrease an activity mediated by the molecule. 

28. A computer-assisted method according to claim 24, in which the method 
5 is used to identify potential compounds which have the ability to increase an 

activity mediated by the molecule. 

29. A computer-assisted method according to claim 28, further comprising the 
step of obtaining a compound with a chemical structure selected in steps (d) and 

10 (e), and testing the compound for the ability to increase an activity mediated by 
the receptor. 

30. A method of screening a putative compoimd having the ability to 
modulate the activity of a molecule of the EGF receptor family, comprising the 

15 steps of identifying a putative compound by a method according to any one of 
claims 1 to 29, and testing the compound for the abilify to increase or decrease 
an activity mediated by the molecule. 

31 . A method according to claim 30, in which the test is carried out in vitro. 

20 

32. A method according to claim 31, in which the test is a high throughput 
assay. 

33. A method according to claim 30, in which the test is carried out in vivo. 

25 

34. A method as claimed in any one of claims 1 to 33 in which the molecule 
of the EGF receptor family is selected from the group consisting of the EGF 
receptor, ErbB2, ErbB3 andErbB4. 

30 35 . A method as claimed in claim 34 in which the molecule of the EGF 
receptor family is the EGF receptor. 

36. A compound able to bind to a molecule of the EGF receptor family and to 
modulate an activity mediated by the molecule, the compound being obtained by 
35 a method according to any one of claims 1 to 35. 
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37. A compound which possesses stereochemical complementarity to a 
topographic region of a molecule of the EGF receptor family and modulates 
an activity mediated by the molecule, wherein the molecule is characterised 
by 

5 (i) amino acids 1-621 of the EGF receptor positioned at atomic 

coordinates substantially as shown in Figure 6; 

(ii) one or more subsets of said amino acids related to the coordinates 
shown in Figure 6 by whole body translations and/or rotations; or 

(iii) amino acids present in the amino acid sequence of a member of 
10 the EGF receptor family, which form an equivalent three-dimensional 

structure to that of the receptor site defined by amino acids 1-621 of the EGF 
receptor positioned at atomic coordinates substantially as shown in Figvu» 6; 

with the proviso that the compovmd is not a naturally occurring ligand 
of a molecule of the EGF receptor family or a mutant thereof. 

15 

38. A compound as claimed in claim 37 in which the topographic region of 
iiie molecule is defined by amino acids 1-475 of the EGF receptor positioned 
at atomic coordinates substantially as shown in Figure 6, or an amino acid 
sequence which forms an equivalent three-dimensional structme to that of 

20 the region defined by amino acids 1-475 of the EGF receptor positioned at 
atomic coordinates substantially as shown in Figm"e 6. 

39. A compovind as claimed in claim 37 in which the topographic region of 
the molecule is defined by amino acids 313-621 of the EGF receptor 

25 positioned at atomic coordinates substantially as shown in Figure 6, or an 
amino acid sequence which forms an equivalent three-dimensional structvuB 
to that of the region defined by amino acids 313-621 of the EGF receptor 
positioned at atomic coordinates substantially as shown in Figure 6. 

30 40. A compoimd as claimed in anyone of claims 37 to 39, in which the 

stereochemical complementarity between the compound and the molecule is 

such that the compound has a for the receptor site of less than 

10"^M. 

35 41. A compoimd as claimed in claim 40 in which the Kj is less than 10"*M. 
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42. A compound as claimed in any one of claims 36 to 41, wherein the 
compoimd increases an activity mediated by a molecule of the EGF receptor 
family. 

5 43. A compound as claimed in any one of claims 36 to 41, wherein the 
compoimd decreases an activity mediated by a molecule of the EGF receptor 
family. 

44. A compoimd as claimed in any one of claims 36 to 43 in which the 

10 molecule of the EGF receptor family is selected firom the group consisting of the 
EGF receptor, ErbB2, ErbBS and ErbB4. 

45. A compound as claimed in claim 44 in which the molecule of the EGF 
receptor family is the EGF receptor. 

15 

46. A pharmaceutical composition for preventing or treating a disease 
which would benefit from increased signalling by a molecule of the EGF 
receptor family, which comprises a compound as claimed in claim 42 and a 
pharmaceutically acceptable carrier or diluent. 

20 

47. A pharmaceutical composition for preventing or treating a disease 
associated with signalling by a molecule of the EGF receptor family, which 
comprises a compound as claimed in claim 43 and a pharmaceutically 
acceptable carrier or diluent. 

25 

48. A method of preventing or treating a disease which would benefit irom 
increased signalling by a molecule of the EGF receptor family which method 
comprises administering to a subject in need thereof a compound as cleiimed 
in claim 42. 

30 

49. A method according to claim 48 wherein the disease is selected from 
wound healing and gastric ulcers. 



35 



50. A method of preventing or treating a disease associated with signalling 
by a molecule of the EGF receptor family which method comprises 
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administering to a subject in need thereof a compound as claimed in claim 
43. 

51. A method according to claim 50 wherein the disease is selected from 
psoriasis and timiour states consisting of cancer of the breast, brain, ovary, 
cervix, pancreas, limg, head and neck, and melanoma, rhabdomyosarcoma, 
mesothelioma and glioblastoma. 

52. A method as claimed in any one of claims 48 to 51 in which the molecule 
of the EGF receptor family is selected firom the group consisting of the EGF 
receptor, ErbB2, ErbB3 and ErbB4. 

53. A method as claimed in claim 52 in which the molecule of the EGF 
receptor family is the EGF receptor. 
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Figure 3 




Figure 4 
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Figure 5 
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ATOM 155 N GLU 21 
ATOK 155 CA GLU 21 
ATC»: 15^ CB GLU 21 
ATOM 158 CG GLU 21 
ATO.y 159 CD GLU 21 
ATW. 160 OEl GLU 21 
ATOM 161 0E2 GLU 21 
ATOK 162 C GLU 21 
ATOM 163 O GLU 21 
ATOM 164 N ASP 22 
ATOM 165 CA ASP 22 
ATOM 166 CD ASP 22 
ATOK 167 CG ASP 22 
ATOM 168 ODl ASP 22 
ATOM 169 0D2 ASP 22 
ATOM 170 C ASP 22 
ATOM 171 O ASP 22 
ATOM 172 N HIS 23 
ATOM 173 CA HIS 23 

ATOM 174 NDl HIS 23 

ATOM 175 NE2 HIS 23 

ATOM 176 CEl HIS 23 

ATOM 177 CD2 HIS 23 

ATOM 178 CG HIS 23 

ATOM 17 9 CB HIS 23 

ATOM 180 C HIS 23 

ATOM 181 O HIS 23 

ATOM 182 N PHE 24 

ATCW 183 CA PHE 24 

ATOM 184 CB PHE 24 

ATOM 185 CG PHE 24 

ATOM 186 CDl PHE 24 

ATCM 187 CD2 PHE 24 

ATOM 188 CEl PHE 24 

ATOM 189 CE2 PHE 24 

ATOM 190 CZ PHE 24 

ATOM 191 C PHE 24 

ATOM 192 O PHE 24 

ATOM 193 N LEU 25 

ATOM 194 CA LEU 25 

ATOM 195 CB LEU 25 

ATOM 196 CG LEU 25 

ATOM 197 CDl LEU 25 

ATOM 198 CD2 LEU 25 

ATOM 199 C LEU 25 

ATOM 200 O LEU 25 

ATOM 201 N SER 26 

ATOM 202 CA SER 2G 

ATOM 203 CB SER 26 

ATOM 204 OG SER 26 

ATOM 205 C SER 26 

ATOM 206 O SER 26 

ATOM 207 N LEU 27 

ATOM 208 CA LEU 27 

ATOM 209 CB LEU 27 

ATOM 210 CG LEU 27 

ATOM 211 CDl LEU 27 

AT<»1 212 CD2 LEU 27 

ATCW 213 C LEU 27 

ATWl 214 O LEU 27 

ATOM 215 N GLN 28 

ATOM 216 CA GLN 28 

ATOM 217 CB GLN 28 

ATOM 218 CG GLN 28 

ATOM 219 CD GLN 28 

ATOM 220 OEl GLN 28 

ATOM 221 NE2 GLN 28 

ATC»1 222 C GLN 28 

ATC»1 223 O GLN 28 

ATOM 224 N ARG 29 

ATOM 225 CA ARG 29 

ATOM 226 CB ARG 29 

ATOM 227 CG ARG 29 

ATOM 228 CD ARG 29 

ATOM 229 NE ARG 29 

ATOM 230 CZ ARG 29 

ATOM 231 NHl ARG 29 



61.184 -0.640 100.514 1.00 60.00 

60.471 -0.440 101.743 1.00 60.00 

61.314 0.262 102.821 1.00 60.00 

62.418 -0.629 103.393 l.OO 60.00 

63.461 -0.846 102.306 1.00 60.00 

63.930 0.172 101.730 1.00 60.00 

63.798 -2.029 102.033 1.00 60.00 

59.239 0.374 101.517 1.00 60.00 

58.224 0.142 102.170 1.00 60.00 

59.284 1.356 100.600 1.00 60.00 

58.127 2.184 100.414 1.00 60.00 

58.301 3.259 99.327 1.00 60.00 

59.249 4.315 99.878 1.00 SO. 00 

59.322 4.443 101.129 1.00 60.00 

59.909 S.007 99.058 1.00 60.00 

56.961 1.325 100.044 1.00 60.00 

57.073 0.405 99.235 1.00 60.00 

55.803 1.615 100.670 1.00 40.00 

54.586 0.B92 100.442 1.00 40.00 

56.474 -1.838 101.876 1.00 40.00 

56.905 -2.977 100.017 1.00 40.00 

57.266 -2.757 101.268 1.00 40.00 

55.817 -2.14B 99.817 1.00 40.00 

55.539 -1.442 100.947 1.00 40.00 

54.466 -0.431 101.225 1.00 40.00 

53.501 1.752 101.002 1.00 40.00 

53.574 2.979 100.966 1.00 40.00 

52.456 1.096 101.537 1.00 40.00 

51.390 1.781 102.206 1.00 40.00 

50.221 0.874 102.641 1.00 40.00 

50.686 -0.092 103.676 1.00 40.00 

51.333 -1.248 103.312 1.00 40.00 

50.456 0.154 105.011 1.00 40.00 

51,755 -2.140 104.269 1.00 40.00 

50.876 -0.735 105.971 1.00 40.00 
51.528 -1.885 105.600 1.00 40.00 
51.973 2.437 103.414 1.00 40.00 
51.413 3.393 103.945 1.00 40.00 
53.137 1.940 103.864 1.00 40.00 
53.809 2.441 105.027 1.00 40.00 
55.201 1.807 105.203 1.00 40.00 
55,972 2.301 106.441 1.00 40.00 
55.274 1.867 107.740 1.00 40.00 
57.450 1.880 106.387 1.00 40.00 
54.004 3.919 104.842 1.00 40.00 
53.972 4.680 105.808 1.00 40.00 
54.210 4.353 103.587 1.00 40.00 
54.437 5.732 103.244 1.00 40.00 
54.690 5.930 101,740 1.00 40.00 

55.877 5.257 101,350 1.00 40,00 
53.231 6.550 103.603 1.00 40.00 
53.340 7.745 103.875 1.00 40.00 
52.050 5.910 103.620 1.00 40.00 
50.763 6.522 103.842 1,00 40.00 
49.630 5.478 103.835 1.00 40.00 
49.489 4,723 102.500 l.OO 40.00 
48,353 3.693 102,558 1.00 40.00 
49.347 5.696 101.321 1.00 40.00 
50.727 7.196 105.183 1.00 40.00 
49.977 8.151 105.379 1.00 40.00 
51.536 6.712 106.139 1.00 40.00 
51.542 7.178 107.498 1.00 40.00 
52.603 6.451 108.345 1.00 40.00 
54.035 6.691 107.858 1.00 40.00 
54,993 5.910 108,749 1,00 40,00 
56.205 5.934 108.543 1.00 40.00 
54.437 5.195 109.764 1.00 40.00 
51.825 8,649 107.578 1,00 40,00 
51.263 9.341 108.426 1.00 40.00 
52.677 9.183 106.688 1.00 40.00 
53.065 10.561 106,792 1.00 40.00 
54.017 11.010 105.670 1.00 40.00 
55.395 10.355 105.771 1.00 40.00 
56.453 10.977 104.857 1.00 40.00 
56.118 10.623 103.450 1.00 40.00 
56.967 10.980 102.442 1.00 40.00 
58.110 11,670 102.726 1.00 40.00 
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Figure 6 (continued) 
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Figure 6 (continued) 
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Figure 6 (continued) 
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Figure 6 (continued) 
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Figure 6 (continued) 
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Figure 6 (continued) 
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Figure 6 (continued) 
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Figure 6 (continued) 
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Figure 6 (continued) 
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Figure 6 (continued) 



wo 99/62955 



PCT/AU99/00420 



52/72 



ATO!-; 


1115 


CD 


LYS 


4 55 


81 


.875 


50 


.551 


60 


.724 


1 


.00 


20. 


.00 


ato; 


1116 


CE 


LYS 


4 55 


80, 


. 957 


51. 


.378 


61, 


.626 


1, 


.00 


20. 


.00 


AT»'. 


iin 


UZ 


LYS 


455 


80 


.221 


50, 


.493 


62 


.555 


1, 


.00 


20. 


.00 


ATOM 


1118 


N 


LEU 


456 


84 , 


.673 


46, 


.372 


61, 


.758 


1, 


.00 


20, 


.00 


ATGV 


1119 


CA 


LEU 


4 56 


84 , 


.270 


45. 


.300 


62, 


.627 


1, 


.00 


20. 


.00 


ATOM 


1120 


C 


LEU 


456 


85, 


.381 


44 , 


.885 


63 


.545 


1, 


.00 


20. 


.00 


ATOM 


1121 


O 


LEU 


456 


85, 


.180 


44 . 


.757 


64, 


.752 


1, 


,00 


20, 


, 00 


ATOM 


1122 


CB 


LEU 


4 56 


83, 


.834 


44. 


.040 


61, 


.859 


1, 


.00 


20, 


.00 


ATOM 


1123 


CG 


LEU 


456 


82. 


.556 


44 . 


.232 


61, 


.021 


1, 


. 00 


20. 


, 00 


ATOM 


1124 


CDl 


LEU 


456 


82, 


.773 


45, 


.256 


59.896 


1, 


.00 


20. 


.00 


ATOI 


1125 


CD2 


LEU 


4 56 


82, 


.016 


42, 


.887 


60, 


.510 


1, 


.00 


20. 


.00 


ATOM 


1126 


N 


PHE 


4 57 


86. 


.594 


44 . 


.675 


62. 


.996 


1. 


.00 


20. 


,00 


ATO^! 


1121 


CA 


PHE 


4 57 


87, 


.692 


44 . 


.185 


63, 


.780 


1, 


.00 


20. 


.00 


ATOM 


1128 


C 


PHE 


4 57 


88. 


,572 


45. 


.337 


64. 


.119 


1. 


.00 


20. 


,00 


ATOM 


1129 


o 


PHE 


4 57 


89. 


.281 


45. 


,859 


63, 


.260 


1. 


.00 


20. 


00 


ATa-1 


1130 


CB 


PHE 


4 57 


88. 


,622 


43. 


,229 


63, 


,012 


1, 


,00 


20.00 


ATOM 


1131 


CG 


PHE 


457 


87. 


.894 


41. 


,992 


62. 


.626 


1. 


.00 


20. 


00 


ATOM 


1132 


CDl 


PHE 


4 57 


87 . 


.862 


40. 


,906 


63. 


.468 


1, 


.00 


20. 


,00 


ATCM 


1133 


CD2 


PHE 


4 57 


87. 


.249 


41. 


-919 


61. 


.414 


1, 


,00 


20. 


00 


ATOM 


1134 


CEl 


PHE 


457 


87. 


,195 


39. 


761 


63. 


.103 


1. 


,00 


20. 


00 


ATOM 


1135 


CE2 


PHE 


4S7 


86. 


.580 


40. 


,777 


61. 


.046 


1, 


, 00 


20. 


00 


ATOM 


1136 


CZ 


PHE 


457 


86. 


,552 


39. 


695 


61 . 


,891 


1. 


.00 


20. 


,00 


ATOM 


1137 


N 


GLY 


458 


88. 


.516 


45. 


7 93 


65. 


.378 


1. 


.00 


40. 


00 


ATOM 


1138 


CA 


GLY 


4 58 


89. 


,413 


46. 


,825 


65. 


.7 97 


1. 


.00 


40. 


00 


ATOM 


1139 


C 


GLY 


458 


90, 


.750 


46. 


212 


66, 


,045 


1. 


,00 


40. 


00 


ATOM 


1140 


0 


GLY 


4 58 


91. 


,788 


46. 


,777 


65, 


.701 


1, 


.00 


40. 


00 


ATOM 


1141 


N 


THR 


4 59 


90. 


,738 


44 . 


999 


66. 


.632 


1. 


,00 


40. 


00 


ATOM 


1142 


CA 


THR 


459 


91 . 


936 


44. 


,364 


67 . 


.094 


1. 


,00 


40. 


00 


ATOM 


1143 


C 


THR 


459 


92. 


.968 


44. 


,320 


66. 


.017 


1. 


,00 


40. 


00 


ATOM 


1144 


0 


THR 


459 


94. 


,062 


44 . 


855 


66. 


,195 


1. 


,00 


40. 


00 


ATOM 


1145 


CB 


THR 


459 


91. 


.699 


42. 


.973 


67. 


.625 


1, 


.00 


40. 


00 


ATOM 


1146 


OGl 


THR 


459 


92, 


,918 


42. 


,422 


68. 


.100 


1, 


,00 


40.00 


ATOM 


1147 


CG2 


THR 


459 


91. 


,087 


42. 


,085 


66. 


.529 


1. 


,00 


40. 


00 


ATOM 


1148 


N 


SER 


460 


92. 


.668 


43. 


.707 


64. 


.857 


1. 


.00 


40. 


00 


ATOM 


1149 


CA 


SER 


460 


93. 


,716 


43. 


,681 


63, 


.885 


1, 


,00 


40. 


00 


ATCM 


1150 


C 


SER 


460 


93. 


.211 


43. 


.055 


62. 


.631 


1, 


.00 


40.00 


ATOM 


1151 


O 


SER 


460 


93. 


.008 


41. 


,845 


62, 


.562 


1. 


.00 


40. 


00 


ATOM 


1152 


CB 


SCR 


460 


94 . 


.946 


42. 


,869 


64, 


.325 


1. 


.00 


40. 


00 


ATOM 


1153 


OG 


SER 


460 


95. 


.932 


42. 


.890 


63. 


.306 


1, 


.00 


40. 


00 


ATOM 


1154 


N 


GLY 


461 


92. 


.988 


43. 


,894 


61. 


.606 


1, 


.00 


20. 


00 


ATOM 


1155 


CA 


GLY 


461 


92. 


.549 


43, 


,439 


60. 


.323 


1, 


.00 


20. 


,00 


ATOM 


1156 


C 


GLY 


461 


93. 


, 676 


42. 


,723 


59. 


.644 


1. 


.00 


20. 


00 


ATOt 


1157 


0 


GLY 


461 


93. 


,459 


41. 


,809 


58. 


.850 


1. 


,00 


20. 


00 


ATOM 


1158 


N 


GLN 


4 62 


94 . 


, 924 


43. 


.132 


59. 


.934 


1. 


.00 


20. 


00 


ATCW 


1159 


CA 


GLN 


4 62 


96. 


,068 


42. 


,589 


59. 


.258 


1. 


.00 


20. 


00 


ATOM 


1160 


C 


GLN 


4 62 


96. 


.146 


41. 


.114 


59, 


.504 


1. 


.00 


20. 


00 


ATOM 


1161 


0 


GLN 


4 62 


96. 


,552 


40. 


,356 


58, 


.625 


1. 


.00 


20. 


00 


ATOM 


1162 


CB 


GLN 


462 


97. 


.398 


43. 


,204 


59. 


.729 


1. 


00 


20.00 


ATOM 


1163 


CG 


GLN 


462 


98. 


,619 


42. 


.625 


59. 


.012 


1. 


, 00 


20. 


00 


ATOM 


1164 


CD 


GLN 


462 


99. 


864 


43. 


,306 


59. 


.562 


1. 


,00 


20. 


00 


ATOM 


1165 


DEI 


GLN 


4 62 


100, 


.784 


42, 


.650 


60, 


,044 


1.00 


20.00 


ATOM 


1166 


ne:2 


GLN 


462 


99. 


.901 


44. 


.663 


59. 


.474 


1. 


,00 


20. 


00 


ATOM 


1167 


N 


LYS 


463 


95. 


,759 


40. 


,681 


60. 


.713 


1. 


, 00 


20. 


00 


ATOM 


1168 


CA 


LYS 


4 63 


95, 


,8 67 


39. 


.310 


61. 


.128 


1. 


,00 


20. 


00 


ATOM 


1169 


C 


LYS 


4 63 


94, 


, 938 


38. 


.418 


60, 


.357 


1. 


.00 


20. 


00 


ATOM 


1170 


0 


LYS 


463 


95, 


.195 


37 . 


.220 


60, 


.242 


1. 


.00 


20. 


00 


AT<»1 


1171 


CB 


LYS 


463 


95, 


.609 


39.129 


62, 


,625 


1, 


.00 


20.00 


ATOM 


1172 


CG 


LYS 


463 


96. 


.582 


39. 


.909 


63, 


.511 


1. 


.00 


20. 


00 


ATOM 


1173 


CD 


LYS 


4 63 


98. 


.052 


39. 


.562 


63. 


.269 


1. 


.00 


20. 


00 


ATOM 


1174 


CE 


LYS 


463 


98 . 


.679 


40, 


.355 


62, 


.120 


1. 


.00 


20. 


00 


ATOM 


1175 


HZ 


LYS 


463 


98. 


.788 


41. 


.784 


62, 


.493 


1, 


.00 


20. 


00 


ATOM 


1176 


N 


THR 


464 


93. 


. 814 


38. 


.94 9 


59, 


.837 


1, 


.00 


20. 


00 


ATOM 


1177 


CA 


THR 


464 


92. 


, 864 


38. 


.124 


59, 


.134 


1. 


.00 


20. 


00 


ATOM 


1178 


C 


THR 


464 


93. 


.550 


37, 


.341 


58, 


.060 


1. 


.00 


20. 


,00 


ATOM 


1179 


O 


THR 


464 


94 . 


.479 


37. 


.825 


57, 


.415 


1. 


.00 


20. 


,00 


ATOM 


1180 


CB 


THR 


464 


91 . 


.764 


38. 


.900 


58.472 


1. 


.00 


20. 


00 


ATOM 


1181 


OGl 


THR 


464 


91. 


,012 


39, 


.617 


59, 


.441 


1. 


,00 


20. 


,00 


ATOM 


1182 


CG2 


THR 


4 64 


90. 


.863 


37. 


.917 


57, 


.709 


1, 


,00 


20. 


00 


ATOM 


1183 


N 


LYS 


465 


93, 


.109 


36, 


.080 


57 


.863 


1. 


.00 


20. 


, 00 


ATOM 


1184 


CA 


LYS 


4 65 


93.702 


35, 


.266 


56, 


.843 


1, 


.00 


20. 


,00 


ATOM 


1185 


C 


LYS 


4 65 


92. 


.636 


34 , 


.900 


55, 


.865 


1. 


.00 


20. 


,00 


ATOM 


1186 


0 


LYS 


465 


91 , 


.615 


34 


.310 


56 


.218 


1, 


.00 


20. 


.00 


ATOT'I 


1187 


CB 


LYS 


4 65 


94 . 


.341 


33, 


.974 


57, 


.380 


1, 


,00 


20. 


,00 


ATOM 


1188 


CG 


LYS 


465 


95, 


.607 


34 


.234 


58 


.201 


1, 


.00 


20. 


.00 


ATOM 


1189 


CD 


LYS 


4 65 


96, 


.091 


33 


.027 


59 


.008 


1, 


.00 


20. 


,00 


ATOM 


1190 


CE 


LYS 


465 


97. 


,369 


33, 


,295 


59.806 


1, 


,00 


20. 


00 


ATOM 


1191 


NZ 


LYS 


465 


97, 


,091 


34 , 


.252 


60.898 


1. 


.00 


20. 


.00 



Figure 6 (continued) 



wo 99/62955 



PCT/AU99/00420 



53/72 



ATOM 


1192 


N 


ILS 


466 


92 


. 846 


35 


.262 


54. 


.586 


1 


.00 


20, 


.00 


ATOM 


1193 


CA 


ILE 


466 


91 


.84 8 


34 


. 94 6 


53. 


,614 


1 


.00 


20, 


,00 


ATOM 


1194 




ILE 


4 66 


92 


.517 


34 


.355 


52, 


, 417 


1 


.00 


20 


.00 


ATOM 


1195 


o 


ILE 


466 


93.466 


34 


.923 


51. 


, 861 


1 


.00 


20, 


.00 


ATOM 


1196 


CB 


ILE 


466 


91, 


.074 


36, 


.151 


53. 


, 170 


1 


.00 


20, 


.00 


ATOM 


1197 


CGI 


ILE 


466 


90 


.333 


36 


.766 


54 . 


,369 


1 


.00 


20, 


.00 


ATOM 


1198 


CG2 


ILE 


466 


90 


.152 


35, 


.737 


52. 


,013 


1 


.00 


20, 


.00 


ATOM 


1199 


CDl 


ILE 


466 


89 


.731 


38 


.139 


54. 


, 07n 


1 


.00 


20, 


.00 


ATOM 


1200 


N 


ILE 


4 67 


92. 


.035 


33, 


.172 


51. 


, 984 


1 


.00 


20, 


.00 


KKM 


1201 


CA 


ILE 


467 


92, 


.573 


32, 


.536 


50. 


,817 


1 


.00 


20.00 


ATOM 


1202 


C 


ILE 


457 


91, 


.536 


31 


.584 


50. 


,304 


1 


.00 


20, 


.00 


ATOM 


1203 


o 


ILE 


467 


90, 


.683 


31, 


,122 


51. 


, 058 


1 


.00 


20, 


.00 


ATOM 


1204 


CB 


ILE 


467 


93, 


.786 


31, 


.696 


51. 


.094 


1 


.00 


20, 


.00 


ATOM 


1205 


CGI 


ILE 


467 


93. 


.422 


30. 


.542 


52. 


.043 


1 


.00 


20, 


.00 


ATOM 


1206 


CG2 


ILE 


4 67 


94. 


.909 


32, 


,604 


51. 


623 


1 , 


.00 


20. 


.00 


ATOM 


1207 


CDl 


ILE 


467 


94, 


.4 98 


29.462 


52. 


138 


1 


.00 


20. 


.00 


ATOM 


1208 


N 


SER 


468 


91. 


.619 


31. 


,255 


48. 


996 


1 


.00 


20. 


00 


ATOM 


1209 


CA 


SER 


468 


90. 


.783 


30. 


.284 


48. 


34 3 


1 


.00 


20, 


.00 


ATOM 


1210 


C 


SER 


468 


89. 


.360 


30. 


,746 


48. 


165 


1, 


.00 


20. 


00 


ATOM 


1211 


0 


SER 


468 


86. 


,467 


29. 


,925 


47. 


97 0 


1, 


.00 


20. 


00 


ATOM 


1212 


CB 


SER 


468 


90. 


.779 


28. 


,934 


49. 


089 


1, 


. 00 


20. 


00 


ATOM 


1213 


OG 


SER 


468 


90.070 


27, 


,948 


48. 


353 


1 , 


.00 


20. 


00 


ATOM 


1214 


N 


RSN 


469 


89. 


.097 


32. 


,068 


48. 


185 


1, 


.00 


20. 


.00 


ATOM 


121S 


CA 


ASM 


469 


87. 


.747 


32, 


,520 


47. 


969 


1, 


.00 


20. 


00 


ATOM 


1216 


C 


asn 


469 


87. 


.572 


32. 


,802 


46. 


507 


1, 


.00 


20. 


00 


ATOM 


1217 


O 


ASN 


469 


88. 


.441 


32, 


,493 


45. 


695 


1, 


.00 


20. 


00 


ATCM 


1218 


CB 


ASN 


469 


87. 


.384 


33, 


,794 


48. 


745 


1, 


.00 


20. 


00 


ATOM 


1219 


CG 


ASN 


469 


87. 


.287 


33, 


,412 


50. 


217 


1 , 


.00 


20. 


00 


ATOM 


1220 


GDI 


ASN 


469 


86. 


.717 


32, 


.378 


50. 


565 


1, 


.00 


20. 


00 


ATOM 


1221 


ND2 


ASN 


469 


87. 


.B64 


34, 


.263 


51. 


106 


1, 


.00 


20. 


00 


ATCM 


1222 


N 


ARC 


470 


86. 


.410 


33, 


.377 


46. 


130 


1 


.00 


20. 


00 


ATOM 


1223 


CA 


ARG 


470 


86. 


.132 


33, 


.660 


44. 


748 


1. 


.00 


20.00 


ATOM 


1224 


C 


ARG 


470 


86. 


.921 


34, 


.869 


44. 


341 


1, 


.00 


20. 


00 


ATOM 


1225 


O 


ARG 


470 


87. 


.221 


35, 


.734 


45. 


161 


1, 


. 00 


20. 


.00 


ATOM 


1226 


CB 


ARG 


470 


84. 


.637 


33.931 


44. 


482 


1, 


.00 


20.00 


ATOM 


1227 


CG 


ARG 


470 


84. 


.275 


34, 


.019 


42. 


. 999 


1, 


.00 


20. 


00 


ATOM 


1228 


CD 


ARG 


470 


82. 


.772 


34, 


.139 


42. 


737 


1, 


.00 


20. 


00 


ATOM 


1229 


KE 


ARG 


470 


82.588 


34. 


.216 


41. 


260 


1, 


.00 


20. 


00 


ATOM 


1230 


CZ 


ARG 


470 


81. 


.339 


34, 


.380 


40. 


735 


1. 


.00 


20. 


.00 


ATOM 


1231 


NHl 


ARG 


470 


60.257 


34, 


.478 


41. 


562 


1, 


.00 


20. 


00 


ATOM 


1232 


NH2 


ARG 


470 


81, 


.173 


34, 


.450 


39. 


.381 


1. 


.00 


20.00 


ATOM 


1233 


N 


GLY 


471 


87. 


.287 


34. 


.952 


43. 


04 2 


1, 


.00 


40. 


00 


ATOM 


1234 


CA 


GLY 


471 


88. 


,067 


36. 


.057 


42. 


562 


1, 


.00 


40. 


00 


ATOM 


1235 


C 


GLY 


471 


87. 


.152 


37, 


,224 


42. 


377 


1, 


.00 


40.00 


ATOM 


1236 


O 


GLY 


471 


86. 


076 


37, 


.104 


41. 


795 


1, 


.00 


40. 


oo 


ATOM 


1237 


N 


GLU 


472 


87. 


.613 


38, 


,410 


42. 


811 


1, 


.00 


40. 


00 


ATCM 


1238 


CA 


GLU 


472 


86. 


824 


39,608 


42. 


795 


1. 


.00 


40. 


oo 


ATOM 


1239 


C 


GLU 


472 


86. 


.359 


39. 


.843 


41. 


394 


1, 


.00 


40. 


oo 


ATOM 


1240 


O 


GLU 


472 


85. 


.213 


40, 


.227 


41. 


168 


1. 


.00 


40. 


00 


ATOM 


1241 


CB 


GLU 


472 


87. 


.656 


40, 


.849 


43. 


158 


1 . 


.00 


40. 


00 


ATOM 


1242 


CG 


GLU 


472 


88. 


,242 


40, 


.651 


44. 


,569 


1. 


.00 


40. 


00 


ATOM 


1243 


CD 


GLU 


472 


89. 


.222 


42, 


.008 


44. 


650 


1. 


.00 


40. 


00 


ATOM 


1244 


OEl 


GLU 


472 


90.181 


42. 


oie 


43. 


832 


1. 


.00 


40.00 


ATOM 


1245 


OE2 


GLU 


472 


89. 


.027 


42. 


,895 


45. 


.523 


1. 


,00 


40. 


00 


ATOM 


1246 


N 


ASN 


473 


87. 


.251 


39, 


.624 


40. 


412 


1. 


.00 


40. 


00 


ATCM 


1247 


CA 


ASN 


473 


86. 


.911 


39, 


,872 


39. 


041 


1, 


.00 


40. 


00 


ATOM 


1248 


C 


ASN 


473 


85.811 


38, 


,947 


38.630 


1, 


,00 


40. 


00 


ATOM 


1249 


0 


ASN 


473 


84. 


.872 


39, 


.352 


37. 


94 4 


1. 


.00 


40. 


00 


ATOM 


1250 


CB 


ASN 


473 


88. 


.101 


39, 


.653 


38. 


. 087 


1, 


.00 


40. 


00 


ATOM 


1251 


CG 


ASN 


473 


88. 


.545 


38, 


.201 


38. 


192 


1, 


. 00 


40. 


00 


ATOM 


1252 


ODl 


ASN 


473 


88. 


.650 


37, 


.650 


39. 


286 


1, 


.00 


40. 


00 


ATOM 


1253 


ND2 


ASN 


473 


88. 


.816 


37, 


,562 


37. 


022 


1, 


.00 


40. 


00 


ATOM 


1254 


N 


SER 


474 


85. 


.903 


37, 


,672 


39. 


04 4 


1, 


.00 


20.00 


ATCM 


1255 


CA 


SER 


474 


84. 


,930 


36, 


.687 


38. 


.670 


1. 


.00 


20. 


00 


ATOM 


1256 


C 


SER 


474 


83. 


.616 


37, 


,035 


39. 


287 


1, 


.00 


20. 


00 


ATOM 


1257 


O 


SER 


474 


82. 


.566 


36, 


.879 


38. 


667 


1. 


. 00 


20. 


00 


ATOM 


1258 


CB 


SER 


474 


85. 


,310 


35, 


.274 


39, 


.145 


1, 


.00 


20. 


00 


ATOM 


1259 


OG 


SER 


474 


84. 


.316 


34, 


.339 


38, 


752 


1. 


.00 


20. 


00 


ATOM 


1260 


N 


CYS 


475 


83. 


.632 


37, 


.527 


40, 


.538 


1 , 


.00 


20. 


00 


ATCW 


1261 


CA 


CYS 


475 


82. 


.399 


37, 


.791 


41, 


211 


1, 


.00 


20. 


00 


ATOM 


1262 


C 


CYS 


475 


81. 


eeo 


38, 


.870 


40. 


491 


1, 


,00 


20. 


00 


ATOM 


1263 


O 


CYS 


475 


80. 


.436 


38, 


.623 


40, 


377 


1 , 


,00 


20. 


00 


ATOM 


1264 


CB 


CYS 


475 


82. 


,545 


38, 


.242 


42. 


667 


1, 


,00 


20. 


00 


ATOM 


1265 


SG 


CYS 


475 


80. 


.882 


38, 


.368 


43, 


.362 


1, 


.00 


20. 


00 


ATOM 


1266 


N 


LYS 


476 


82. 


,387 


39, 


.678 


39, 


. 97 9 


1 . 


,00 


60. 


00 


ATOM 


1267 


c:a 


LYS 


476 


81.730 


40, 


.94 9 


39. 


291 


1, 


,00 


60.00 


ATOM 


1268 


C 


LYS 


476 


80. 


,B42 


41, 


.650 


40, 


.265 


1, 


, 00 


60. 


00 



Figure 6 (continued) 



wo 99/62955 



PCT/AU99/00420 



54/72 



ATOM 


1269 


0 


LYS 


476 


79. 


.807 


■:2. 


, 2C'c 


39. 


902 


1 . 


.00 


60. 


,00 


ATOM 


1210 


CB 


LYS 


476 


80. 


,873 


40. 


477 


38. 


102 


1. 


,00 


60. 


00 


ATC'. 


12"^1 


Cu 


LYS 


476 


81. 


.696 


39. 




35, 


,918 


1. 


.00 


60. 


,00 


ATOM 


1272 


CD 


LYS 


476 


80. 


,866 


39. 


20^ 


35. 


876 


1. 


,00 


60. 


00 


ATOM 


12T3 


CE 


LYS 


476 


81. 


,658 


38. 


812 


34. 


626 


1. 


,00 


60. 


00 


ATOM 


1274 


NS 


LYS 


476 


81, 


.946 


40. 


OlS 


33. 


,814 


1, 


,00 


60. 


,00 


ATO>: 


1275 


U 


ALA 


477 


81. 


,240 


il. 


631 


41. 


547 


1, 


,00 


60. 


00 


ATOM 


1275 


CA 


ALA 


477 


BO. 


,531 


42. 


,330 


42. 


571 


1, 


,00 


60. 


00 


ATOM 


1277 


c 


ALA 


477 


81, 


,458 


43. 


,425 


42. 


964 


1, 


,00 


60.00 


ATa; 


1278 


0 


ALA 


477 


82. 


140 


43. 


988 


42. 


107 


1. 


,00 


60. 


00 


ATO: 


127 9 


CB 


ALA 


477 


60. 


.270 


41. 


476 


43. 


822 


1 , 


,00 


60.00 


ATO!-; 


1280 


N 


THR 


478 


81. 


434 


43. 


762 


44. 


271 


1. 


00 


60. 


00 


ATOi-i 


1281 


CA 


THR 


478 


82. 


,280 


44 . 


706 


44. 


946 


1, 


,00 


60. 


00 


ATO^; 


1282 


C 


THR 


478 


61. 


,379 


45. 


659 


45. 


645 


1. 


00 


60. 


00 


ATOM 


1283 


0 


THR 


478 


80. 


158 


45. 


505 


45. 


621 


1 . 


00 


60. 


00 


ATa-; 


1284 


CB 


THR 


478 


63. 


241 


45. 


491 


44. 


091 


1. 


00 


60. 


00 


ATOM 


1285 


OGl 


THR 


478 


84 . 


no 


46. 


192 


44. 


905 


1. 


00 


60. 


00 


ATaM 


1286 


CG2 


THR 


478 


82. 


,449 


46. 


475 


43. 


218 


1. 


,00 


60. 


00 


ATOM 


1287 


K 


GLY 


479 


81. 


961 


46. 


668 


46. 


311 


1. 


00 


60. 


00 


ATO^^ 


1288 


CA 


GLY 


479 


81. 


135 


47. 


620 


46. 


982 


1. 


00 


60. 


oo 


ATOM 


1289 


C 


GLY 


479 


80. 


,339 


48. 


282 


45. 


917 


1 , 


00 


60. 


00 


ATOM 


1290 


0 


GLY 


479 


79. 


,229 


48. 


758 


46. 


150 


1. 


00 


60. 


00 


ATOM 


1291 


N 


GLN 


480 


80. 


,915 


4B. 


323 


44. 


705 


1, 


,00 


60. 


00 


ATOM 


1292 


CA 


GLN 


480 


80. 


,274 


48, 


950 


43. 


593 


1. 


00 


60. 


00 


ATOM 


1293 


C 


GLN 


480 


79. 


,022 


48. 


203 


43. 


269 


1. 


00 


60. 


00 


ATa-: 


1294 


0 


GLN 


480 


77. 


, 964 


48. 


807 


43. 


111 


1, 


00 


60. 


00 


ATa-i 


1295 


CB 


GLN 


480 


81 . 


,150 


48. 


968 


42. 


329 


1. 


,00 


60. 


00 


ATOK 


1296 


CG 


GLN 


480 


80, 


.508 


49. 


,691 


41, 


145 


1, 


,00 


60. 


00 


ATOM 


1297 


CD 


GLN 


480 


80. 


,495 


51. 


178 


41. 


465 


1. 


00 


60. 


00 


ATOM 


1298 


OEl 


GLN 


480 


79. 


, 907 


51. 


606 


42. 


458 


1. 


00 


60. 


00 


ATOK 


1299 


NE2 


GLN 


480 


81. 


.164 


51. 


,991 


40.604 


1, 


,00 


60. 


00 


KTCf. 


1300 


N 


VAL 


481 


79. 


,086 


46. 


857 


43. 


177 


1. 


00 


60. 


00 


ATOM 


1301 


CA 


VAL 


431 


77. 


.874 


46. 


,197 


42. 


794 


1, 


,00 


60. 


00 


ATOM 


1302 


C 


VAL 


481 


77 . 


.040 


46. 


,010 


44. 


,018 


1, 


,00 


60. 


00 


ATOM 


1303 


O 


VAL 


481 


76. 


.809 


44. 


890 


44. 


459 


1, 


00 


60. 


00 


KT<Xi 


1304 


CB 


VAL 


481 


78. 


.100 


44. 


.852 


42. 


165 


1, 


,00 


60. 


00 


ATOM 


1305 


CGI 


VAL 


481 


76. 


.738 


44 . 


232 


41. 


,800 


1. 


,00 


60. 


00 


ATOM 


1306 


CG2 


VAL 


481 


79. 


.045 


45. 


,029 


40. 


961 


1. 


,00 


60. 


00 


ATOM 


1307 


N 


CYS 


482 


76. 


.550 


47. 


.117 


44. 


,594 


1. 


,00 


20. 


00 


ATOM 


1308 


CA 


CYS 


482 


75. 


.700 


46. 


, 981 


45. 


737 


1, 


00 


20. 


00 


AT<*1 


1309 


C 


CYS 


482 


74 . 


.501 


47. 


,830 


45, 


449 


1, 


.00 


20.00 


ATOM 


1310 


O 


CYS 


482 


74. 


.309 


48. 


.262 


44. 


314 


1. 


,00 


20. 


00 


ATOM 


1311 


CB 


CYS 


482 


76. 


,306 


47. 


,484 


47. 


059 


1. 


,00 


20. 


00 


ATOM 


1312 


SG 


CYS 


482 


75, 


.422 


46. 


.745 


48. 


459 


1. 


,00 


20. 


00 


ATOM 


1313 


N 


HIS 


483 


73. 


,647 


48. 


,093 


46. 


,459 


1. 


,00 


20. 


00 


ATOM 


1314 


CA 


HIS 


483 


T2. 


. 490 


48. 


.892 


46. 


,177 


1, 


.00 


20. 


00 


ATOK 


1315 


C 


HIS 


483 


72, 


.962 


50. 


.278 


45. 


,890 


1. 


,00 


20. 


00 


ATOM 


1316 


0 


HIS 


463 


74 , 


.064 


50. 


667 


46. 


277 


1, 


.00 


20. 


00 


ATOK 


1317 


CB 


HIS 


483 


71. 


.455 


48. 


. 954 


47. 


.314 


1 , 


.00 


20. 


00 


ATOK 


1318 


CG 


HIS 


4B3 


70, 


.136 


49. 


.489 


46. 


,841 


1, 


.00 


20.00 


ATOM 


1319 


NDl 


HIS 


483 


69. 


.832 


50. 


.827 


46. 


.731 


1, 


.00 


20. 


00 


ATOM 


1320 


CD2 


HIS 


483 


69. 


.027 


48, 


.620 


46, 


,420 


1, 


.00 


20. 


,00 


ATOM 


1321 


CEl 


HIS 


483 


69. 


.565 


50, 


,902 


46, 


.253 


1, 


.00 


20. 


00 


ATOM 


1322 


NE2 


HIS 


463 


68. 


.035 


49. 


.709 


46, 


.048 


1. 


.00 


20, 


,00 


ATOK 


1323 


N 


ALA 


464 


72. 


.129 


51. 


.056 


45, 


.177 


1. 


.00 


20, 


00 


ATOM 


1324 


CA 


ALA 


484 


72, 


.479 


52. 


.397 


44. 


.810 


1. 


.00 


20, 


.00 


ATCTI 


1325 


C 


ALA 


484 


72, 


.633 


53. 


.183 


46. 


.070 


1. 


.00 


20, 


.00 


ATOM 


1326 


0 


ALA 


484 


73, 


.512 


54. 


.036 


46. 


.182 


1. 


.00 


20. 


,00 


ATOM 


1327 


CB 


ALA 


484 


71, 


.397 


53. 


.086 


43. 


.963 


1, 


.00 


20, 


.00 


ATOK 


1328 


N 


LEU 


485 


71 , 


.761 


52, 


.894 


47. 


.053 


1.00 


20, 


.00 


ATOM 


1329 


CA 


LEU 


485 


71 


.700 


53, 


.607 


48, 


.296 


1, 


.00 


20, 


.00 


ATOM 


1330 


C 


LEU 


48S 


72 


. 942 


53, 


.404 


49, 


.104 


1, 


.00 


20, 


.00 


ATOM 


1331 


0 


LEU 


485 


73 


.430 


54, 


.351 


49, 


.719 


1, 


,00 


20, 


,00 


ATa-1 


1332 


CB 


LEU 


485 


70 


.524 


53. 


.126 


49, 


.167 


1 


.00 


20, 


,00 


ATKM 


1333 


CG 


LEU 


48S 


69 


.159 


53. 


.245 


48. 


.467 


1, 


.00 


20, 


. 00 


ATOK 


1334 


CDl 


LEU 


485 


68 


.011 


52, 


.771 


49, 


.377 


1, 


.00 


20, 


.00 


ATOM 


1335 


CD2 


LEU 


465 


68 


.937 


54 


.658 


47 


.908 


1, 


.00 


20. 


.00 


ATOM 


1336 


N 


CYS 


486 


73 


.484 


52. 


.168 


49, 


.133 


1, 


.00 


20. 


. 00 


ATa-i 


1337 


CA 


CYS 


486 


74 


.625 


51, 


.904 


49, 


.965 


1, 


.00 


20. 


. 00 


ATCM 


1338 


C 


CYS 


486 


75 


.706 


52, 


.850 


49, 


.592 


1 


.00 


20. 


.00 


ATCW 


1339 


o 


CYS 


486 


75 


.889 


53, 


.172 


48 


.420 


1 


.00 


20.00 


ATCK 


1340 


CB 


CYS 


486 


75 


.216 


50, 


.4 93 


49.810 


1, 


.00 


20.00 


ATOK 


1341 


SG 


CYS 


486 


74 


.121 


49, 


.204 


50.457 


1, 


.00 


20. 


.00 


ATOM 


1342 


N 


SER 


487 


76 


.435 


53, 


.359 


50 


.600 


1, 


.00 


20. 


.00 


ATOM 


1343 


CA 


SER 


487 


77 


.514 


54, 


.217 


50.245 


1 


.00 


20, 


.00 


ATor-; 


1344 


C 


SER 


487 


78 


.465 


53 


.328 


49 


.553 


1 


.00 


20. 


.00 


ATa< 


1345 


o 


SER 


487 


78 


.379 


52, 


.106 


49 


.660 


1 


.00 


20, 


.00 
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Figure 6 (continued) 
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Figure 6 (continued) 
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